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A STUDY OF SIMPLE LEARNING UNDER IRRELEVANT 
MOTIVATIONAL-REWARD CONDITIONS 


BY K. W. SPENCE, G. BERGMANN, AND R. LIPPITT 
From the Psychological Laboratory, University of Iowa 


INTRODUCTION 


In the first report (22) of a series of 
experimental studies concerned with 
the motivational-reward conditions 
underlying learning in animals, the 
sign-gestalt (non-reinforcement) the- 
ory of trial-and-error learning as for- 
mulated by Tolman (25) and others 
(12, 29) was subjected to experimental 
test. According to this theory such 
selective learning involves the acquisi- 
tion of sign-gestalt-expectations with 
respect to the various aspects (signs) 
of the situation, the alternative re- 
sponses and the different significates 
to which each of the latter leads. 
These expectation-sets (also termed 
cognitions, hypotheses, etc.) consti- 
tute one of Tolman’s so-called inter- 
vening or theoretical variables which 
are assumed to be determinants of be- 
havior. Thus, in the alternative 
choice situation typical of simple trial- 
and-error learning, the response on any 
trial will depend upon the state of 
these cognitions (C) and another inter- 
vening variable, motivational state 
(M) or, more briefly, B = f(C, M). 

Cognition (C) and motivation (M) 
are, in Tolman’s system, hypothetical 
state variables of the organism which 


are to be specified in terms of certain 
past experimental operations or con- 
ditions. The particular point at issue 
in our experiment was Tolman’s hy- 
pothesis as to the conditions essential 
to the development of the cognitive 
states. Tolman and his group have 
taken the position that this learning 
factor (i.e., cognition) develops as the 
result of mere association by contigu- 
ity (sequence in experience) of the 
sign, the response and the significate, 
and that such learning is not due, as 
the opposing reinforcement theorists 
have claimed, to any rewarding prop- 
erty possessed by a significate. That 
is, according to this view, the learning 
of sign-significate expectations is not 
dependent upon their being a reward 
or goal object for which the organism 
is at the moment motivated. Any 
differential reward values among the 
significates will, Tolman admits, make 
for a difference in the response of the 
organism at any moment, but they 
play little or no part in the learning 
per se—i.e., the formation of the 
expectation-sets. 


As a test of this theoretical position 10 
white rats were run in a simple two-choice 
Y-maze, one alley (left) of which led to a 
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goal box containing food and the other 
(right) to a goal box containing a water 
bottle. The Ss, motivated for water and 
satiated for food, were given training for 
12 days, in which each S§ was given five 
trials per day. In two of the five trials 
the 5's were free to choose either path and 
the remaining three trials were forced, 
one to the water and two to the food side. 

On the day following the completion of 
the learning series, a critical test was 
given in which the motivation of the Ss 
was shifted from thirst to hunger. Ac- 
cording to Tolman’s hypothesis, the Ss 
should have learned what object (food or 
water) each of the two alleys led to and 
when motivated for food they should 
have been expected to reveal this learning 
by taking the alley leading to the food. 
The results of the test, however, were en- 
tirely negative to this theoretical impli- 
cation, for every animal, although hun- 
gry, chose the water alley. Futhermore, 
it was found that this group of subjects 
subsequently were unable to learn to go 
to the food alley more readily than a com- 
parison group which had had the same 
previous training experience of 12 days, 
except that no food had ever been present 
in the left goal box. Both of these re- 
sults are in line, it may be noted, with the 
implications of the reinforcement type of 
theory such as that of Hull (6) and 
Thorndike. 

A second study in this series essentially 
similar in design to this investigation has 
been carried out by Kendler (9). The 
maze employed in this instance was of 
the simple T-type and further differenti- 
ation of the critical signs was achieved 
by having one of the alleys painted black 
while the other remained unpainted. Six 
of the subjects were trained under thirst, 
as in the Spence and Lippitt study and 
four under hunger drive. In the test 
series involving a shift of motivation all 
subjects continued to run to the side 
which had led to the appropriate reward 
object during the training series." 


1 The results for Kendler’s subjects that were 
trained under hunger in the learning series and 
then tested under thirst effectively refute the 
recent criticism of Hayes (2). The latter 
claimed that the subjects of Spence and Lippitt 
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While the findings of these experi- 
ments were quite contrary to what 
Tolman’s theory would lead one to 
expect, it was suggested that certain 
modifications might be made in his 
formulation that would permit one to 
maintain a non-reinforcement position 
and yet account for the present experi- 
mental findings satisfactorily. One 
possibility that suggested itself was 
that the learning involved, in addi- 
tion to, or possibly even instead of. 
the sign-significate expectations with 
respect to food and water, the forma- 
tion of the expectations that the right- 
side alley led to a need-satisfying situ- 
ation without differentiation as to its 
specific nature, whereas the left alley 
led to the expectation of failure of 
need satisfaction. Such ‘knowledge,’ 
no matter what the motivational state, 
would presumably lead to the taking 
of the alley on the right during the 
test trial. 


Statement of Problem 


The experiment described in the 
present report attempted to eliminate 
the differential need-satisfying char- 
acter of the two end boxes that existed 
in our first study. One obvious way 
of accomplishing this was to have the 
Ss satiated for both food and water 
during the training runs. A serious 
difficulty with such a procedure is 
readily apparent, however, in that one 


had had no experience with a strong hunger drive 
prior to the test and could not be expected to 
react appropriately toward food on the test trial. 
Hayes claims to have shown that rats trained 
originally on food did respond appropriately to 
water on the first test trial. He suggested, there- 
fore, that the drive sequence training on hunger, 
testing on thirst would be adequate. The identi- 
cal results of Kendler under both conditions re- 
fute Hayes’ criticism. Furthermore, the results 
of the present experiment, as will be seen, seri- 
ously call into doubt Hayes’ claim that rats will 
not be able to respond appropriately when 
tested under hunger. 
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does not have any drive state that 
will motivate the animal to make the 
necessary responses in the situation. 
In the hope that some mild form of 
motivation (possibly an exploratory 
drive) might operate long enough to 
make the animals enter the alleys and 
end boxes a sufficient number of trials 
during the training period, a group of 
Ss was started under this condition of 
satiation for both food and water. 
Furthermore, in order not to have the 
end boxes serve as a possible goal- 
situation for the animals by reason 
of the fact that entrance into them 
led to their being picked up and placed 
in their home cage or a cage with other 
animals (social goal), the procedure 
was followed of placing each S, on re- 
moval from the goal box, in the same 
type of small individual carrying cage 
that it was in prior to the start of the 
run. If such a situation would lead 


to the necessary behavior taking 
place, it was felt that it would provide 


the ideal experimental conditions, for 
the necessary sequence of events (Sz, 
—R,—Sp, Sr—Rr—Sw) would have 
occurred without presumably any goal 
reinforcement at the end boxes. 


Unfortunately, the exploratory drive, 
or whatever the motivation operating at 
first was, did not persist sufficiently long 
to provide for an adequate number of ex- 
periences in the maze alleys. After mak- 
ing two or three ‘exploratory’ runs into 
the alleys the Ss refused to run in the 
maze but instead would lie down in the 
starting alley and apparently go to sleep. 
Some attempt was made to keep the 
Ss moving by gently pushing them, but 
this was abandoned as unsuccessful. 

With this failure to obtain satisfactory 
runs in the training condition without the 
use of some goal situation, it was decided 
to introduce a goal situation that would 
be identical (equalized) for both end 
boxes. The goal situation hit upon wasa 
cage similar to the home cage of the ani- 
mals, in which all except the S run first 
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on the day found one or more of his home- 
cage mates. As the order of running the 
Ss was changed each day, all animals 
found themselves, being placed in this 
‘social’ situation after reaching one or 
other of the end boxes on most of their 
trials. This procedure apparently pro- 
vided for some type of motivation and 
goal reward, for under it the subjects did 
perform much more satisfactorily than in 
the case of the first group of satiated sub- 
jects. On the occasions that an S would 
not run in a training trial, it was prodded 
until the run was completed. There 
were considerable individual differences 
in the extent to which the Ss were moti- 
vated to get to the end box and eventu- 
ally to the social cage, and on the whole 
the times taken to run were longer than 
in the first experiment in which the Ss 
were motivated by thirst. 


The fact that the conduct of the 
experiment required a motivating con- 
dition and a goal situation is of no 
little significance for the theoretical 
point at issue. For one thing, it 
meant that the arrangement did not 
provide for the sequence in experience 
of the sign, response, and significate 
without the occurrence of a reward or 
goal situation following the sequence. 
Thus, the question of whether the 
mere contiguity of these events in 
experience without a reinforcing state 
of affairs is sufficient for the establish- 
ment of the sign-gestalt-expectations 
with respect to food and water cannot 
be answered by this experimental de- 
sign. Our experiment can only tell us 
whether expectation-sets concerning 
significates for which there is not at 
the moment any need, can be estab- 
lished on the basis of some other mo- 
tivating state and accompanying sig- 
nificate which does have reward value. 
For example, is the white rat able to 
develop expectations with respect to 
the food and water while motivated 
only for the ‘social’ cage which it gets 
to upon entering either of the end 
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boxes? It is apparent, then, that 
while we have equalized the reward 
values of the two end boxes, we are 
not testing the question of whether 
sign-gestalt expectations can be estab- 
lished independently of motivation and 
reward. 


EXPERIMENTAL ProcepuRE 


Subjects —The Ss, 39 white rats from the 
departmental colony, consisted of 15 males and 
24 females. Ages at the beginning of the pre- 
liminary training varied slightly around 90 days. 

Apparatus.—A ground plan and detailed 
description of the apparatus was given in the 
earlier article (22). It is sufficient to say here 
that it was a single-choice Y-maze with a rela- 
tively large square area at the choice point and a 
jump of two in. at the entrance to each of the 
alleys. One of the goal boxes was fitted with a 
water bottle similar in design and appearance to 
the type used in the living cages of the animals. 
The food goal box contained a sliding food tray 
in the floor but this was more or less completely 
covered by some six to eight large pellets of 
Purina Dog Chow spread over the floor. 

Vertically sliding doors, operated from behind 
a one-way screen, prevented the Ss from secing 
the objects in the goal boxes and from retracing 
in the maze. Differentiation of the two alleys 
from one another was increased by having the 
floor at the entrance to the left alley covered 
with fine hardware cloth while the entrance to 
the right alley was covered with a very much 
coarser screening. 

Preliminary training.—After several days of 
handling, the Ss were given an opportunity to 
explore and become familiar with the maze. 
During this experience they were satiated for 
both food and water. Six to ten Ss were placed 
in the maze at the same time and were allowed to 
explore (1) the starting box, choice chamber and 
one alley, (2) the second alley, and (3) both 
alleys simultaneously for about 30 to 40 min., 
respectively. During this period the water 
bottle was removed and the end of the box con- 
taining it covered over. No food was in the food 
goal box. 

On the following day, each S was given two 
runs alone in the apparatus in order to ascertain 
information as to a position preference. On the 
basis of the choices made, each S was classified 
as a left, right, or neutral animal. The maze 
was in the same condition as day 1 and the Ss 
were satiated. 

Satiation routines.—The following feeding 
and drinking schedule was begun two days prior 
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to the first day of preliminary training, and was 
used throughout the training trials. About nine 
hours prior to the time of running, which began 
between 4:30 to 5:00 p.m. each day, the water 
bottles were taken from the living cages. Ap- 
proximately three hours before the time of run- 
ning, the water bottles were placed back on the 
cages and an additional water dish was put into 
the cage. An abundance of food was also placed 
in the cage. About one hour before running 
time an extra dish of small pellets (Purina Dog 
Chow) was put in the cage and the water dishes 
refilled. Following completion of the training 
runs, water was available until the next morning, 
but no food was given until the following day. 

The satiation procedure during the two test 
trials following the completion of training was as 
follows. Prior to being tested under hunger 
motivation, food was withheld for 23 hours. 
The water bottle was available at all times in the 
cage and the procedure of placing an additional 
pan of water in the living cage before the daily 
run was used. In the case of thirst motivation, 
the water bottle were removed approximately 
nine hours prior to the test run. Food was avail- 
able in the cage at all times and small! pellets 
were placed in the living cage before the test 
trial. 

Learning series.—The Ss were given four trials 
daily for seven days. In the first and third trial 
of each day the Ss were free to choose either 
alley. On the second and fourth trial they were 
forced to take the alley not chosen on the previ- 
ous trial. An equal number of experiences in 
each alley was thus provided all subjects. 

The animals were run in groups varying in 
size from six to ten Ss. The same order of run- 
ning was followed on successive trials of a day 
but differed from day to day. All Ss of a group 
were run before the next trial was given, a pro- 
cedure which provided for a minimum interval 
of at least 10 min. between the successive trials 
for each S. 

As was described in the section on the state- 
ment of the problem, the motivation for running 
the maze was the ‘need’ to get to the social cage 
containing cage mates. The Ss were taken to 
the experimental room in a carrying cage which 
consisted of a number of small individual com- 
partments (4 in. by 10 in.). S was taken out of 
this carrying cage and placed directly in the 
starting box of the maze. At the completion of 
the trial it was placed in the social cage. In the 
case of trials two, three and four, each S was 
taken out of the social cage three runs before its 
turn to run and placed in one of the narrow com- 
partments of the carrying cage. After the last 
trial of the day, a five-min. period elapsed before 
the group as a whole was returned to its living 
cages, 
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Upon entering an end box S was permitted to 
explore until E was sure that it had examined the 
food or the water orifice. As the food was 
spread out over the floor of the goal box, the Ss 
were forced to walk over it and there was little 
difficulty in determining whether they sniffed at 
it. In the case of the water spout, however, S 
was not so likely to come upon it in the course of 
its exploration. If at the end of a 15-sec. period 
the water spout had not been approached, the S 
was picked up and placed directly before the 
spout with its nose just a fraction of an inch from 
it. Typically the Ss moved around the goal box 
sniffing at either the food or water spout within 
the 15-sec. period. Care was taken to note 
whether the animal ate any of the food or drank 
water. No Ss included in the 39 cases were ever 
observed to eat or drink. 

Test series.—Test trials were given on the two 
days following the training period. On these 
trials the Ss were motivated on one day for one 
of the goal objects and satiated for the other. 
On the following day the motivation was re- 
versed. If the S chose correctly (i.e., chose the 
alley which led to the goal object for which 
motivated) on the first trial of the day, it was 
permitted, depending on its motivation, either 
to drink for five to ten sec. or to eat for the same 
period. It was then placed in the social cage 
and after a 10-min. period returned to the home 
cage. If the first test trial was incorrect, the S 
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was given further trials until the correct alley 
was taken. The same procedure was followed 
on the second test given on the following day. 
In the first four groups of subjects run, the 
decision as to which motivation was to be pres- 
ent in a particular S on the first trial was deter- 
mined by the preference for the two alleys shown 
in the fourteen free choices of the training series. 
In all instances, if an S had chosen one of the 
alleys less than 50 percent (six times or less), it 
was motivated for the goal object in that alley. 
In the last group of ten subjects the reverse 
procedure was employed. In this group the Ss 
were motivated for the goal object to be found in 
the alley for which a preference had been shown 
—i.e., eight or more choices of it made. In the 
case of five Ss which chose each alley 50 percent 
of the time on the training trials, three were 
arbitrarily made thirsty and two hungry. 


REsuLtTs 


Fig. 1 presents the results for all 39 
Ss in terms of the percentage of re- 
sponses made on the first trial of each 
training day and the two test days to 
the side containing the goal object ap- 
propriate for the motivation of the first 
test. ‘These values tend to run below 





PER CENT TO 1ST TEST GOAL SIDE 





ethhenns 





a 


+ 


DAYS 


Fic. 1. Percentages of Ss that chose the side appropriate on 
during seven training days and two test days 
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the 50 percent line for the training 
days because, as was indicated previ- 
ously, the majority of Ss were moti- 
vated on test one for the goal object 
on their non-preferred side. 

The marked shift in the curve at the 
first test day suggests that the sub- 
jects had learned something about the 
location of the two goal objects during 
this training period and when moti- 
vated for one of them, tended to take 
the alley leading to it. Twenty-four 
(61.5 percent) of the Ss chose the 
alley leading to the appropriate goal 
on this first test, as compared with 
only 33.3 percent choice of this alley 
on the first trial of the last day of 
training. The difference of 28.5 per- 
cent between training day seven and 
the first test day is 2.35 times its 
standard error, a critical ratio which 
is significant beyond the one percent 
level of confidence.” 

As a further check, a more stable 
measure of the training preference 
than that provided by the first trial 
of the last training day was sought by 
ascertaining which arm was chosen 
two or more times on the first trial of 
the last three training days. Table I 
presents the comparison of this meas- 
ure (column 2) with the test perform- 
ance (column 3). The critical ratio, 
determined by the method for depend- 
ent samples, is shown in the last col- 
umn. The critical ratio (2.98) for all 
39 subjects is again significant at well 
beyond the one percent confidence 
level. 

Table I, it will be seen, presents a 
further analysis of the data in terms 
of the subgroups that were hungry or 
thirsty on the first test. It will be 
seen that the hungry subjects showed 
a significant (P <.01) shift from 


* The standard error of the difference between 
the percentages was computed by the formula 
for dependent samples as described by McNemar 
(16, p. 77). 
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their training performance on test 1, 
whereas the shift for the thirsty sub- 
group was not significant (P > .14). 
The poorer result for the latter group 
is not unlikely due to the fact that a 
thirst drive based on only nine hours 
of water deprivation was used in this 
experiment. This degree of thirst was 
selected on the basis of the obstruction 
box studies as being approximately 
equal in strength to a 24-hour hunger 
drive, but these results suggest that 
this may not be the case. 

Looking at the results for the second 
test, in which the motivation was 
shifted from hunger to thirst, or vice 
versa, it will be observed in Fig. 1 that 
the choice point behavior of the sub- 
jects shifted markedly again. Only 
30.8 percent of the Ss chose the alley 
leading to the goal object of test one, 
while 69.2 percent chose the alley 
leading to the appropriate goal for the 
second test. While this shift in 
choice of the two alleys was appropri- 
ate to the motivational change, it 
should be noted that the percentage 
of appropriate choices on this second 
test (69.2 percent) did not, as in the 
first test, differ significantly from the 
percentage of responses made to the 
same alley during the first trial of 
the last training day (66.7 percent). 

Table II presents the results of the 
second test in terms of the training 


TABLE I 


Percentace or Ss THat RESPONDED TO THE 
Stowe AppropriaTE FOR THE First Test on 
(a) tHe Last Taree Days or Tratnine, 
anpD (b) THe First Test Day 
(a) Riese | (b) 
Three | Test 


Training | Day 
| Dave % X % 








Critical 
Ratio 
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TABLE Il 


Percentace or Ss THat RESPONDED TO THE 
Siwe AppropriaTe FOR THE SEconND Test 
on (a) THe Last Turee Days or 
Trarninc anv (b) tHe Seconp 
Test Day 
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Motivational 
‘onditions 





69.2 
75-0 


Hungry (N =23) 65.3 -32 














performance measure based on the last 
three days of training. It will be ob- 
served that neither the group as a 
whole nor the two sub-groups shifted 
significantly from their training pre- 
ference to the correct alley. Indeed, 
the Ss that were thirsty went fewer 
times to the appropriate side of the 
test day than they had gone on the 
last three days of training. 


Discussion or RESULTS 


Since the completion of this experi- 
ment in 1940 (21), a number of studies 
(10, 15, 24) essentially similar in prin- 
ciple and design to it have been carried 
out in the Iowa laboratory and one 
almost identical study has been re- 
ported by Meehl and MacCorquodale 
(17). We shall first present a sum- 
mary of the findings of all these experi- 
ments and then discuss their implica- 
tion for the rival theoretical interpre- 
tations. 


The experiments of Kendler (10) and 
Meehl and MacCorquodale (17) gave 
almost identical results with those of the 
present study. Both obtained a sig- 
nificant number of appropriate responses 
on the first test, the results of the Meehl 
and MacCorquodale study being at the 
one percent level and that of Kendler at 
the five percent level of significance. 
Both studies also agreed with the present 
one in failing to obtain positive results 
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on the second test in which the motiva- 
tion of the first test was reversed. From 
the description of their experiment, the 
motivational-reward conditions of the 
Meehl and MacCorquodale investigation 
closely approximated those of the present 
study. The chief difference appears to 
have been that the Ss were put back in 
the home cage only after the second and 
fourth trials of the day. Whether they 
were placed in a cage containing other 
animals (social goal) after the first and 
third trials of the day is notclear. These 
experimenters explicitly recognize the 
operation of such reinforcements as “‘be- 
ing removed from the confinement of the 
maze, returning to the home cage, etc.” 
(17, p. 375). 

Kendler’s study, carried out in the 
lowa laboratory, deliberately attempted 
to avoid the social goal box and substi- 
tuted pushing of the S through the maze 
when it remained in one spot for more 
than five min. Presumably the motiva- 
tion for locomotion in this study was the 
mild, noxious one resulting from the 
pushing. By proceeding to either end 
box the § escaped such prodding. It was 
then picked up from the maze and placed 
back in the carrying cage to await the 
next trial. The poorer results of this 
study might possibly be ascribed to the 
slower running of the Ss and hence the 
longer time elapsing between entering an 
alley and the subsequent experiencing of 
the food or water. 

All three of these studies, then, gave 
positive evidence of the acquisition of 
sign-significate expectations with respect 
to the food and water when not motivated 
for these objects on the first test, but gave 
negative evidence on the second test. 
To complicate the picture even more, the 
two most recent studies (15, 24) in the 
Iowa laboratory have failed to give any 
favorable evidence whatever for such 
learning. Maltzman’s (1§) experiment 
attempted to improve the social reward 
conditions employed in the present ex- 
periment with a view to speeding up the 
running of the Ss and thus reducing the 
time between the occurrence of the cues 
at the entrance of an alley and the sub- 
sequent occurrence of the significates. 
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Other things equal, this presumably 
should provide better conditions for the 
acquisition of the essential cognitions. 
Maltzman did obtain such a speeding up 
of running at least over the present study 
and that of Kendler by placing the social 
cages, which contained a cage mate, at 
the end of each alley, one foot beyond the 
boxes containing the significates, food 
and water. The Ss ran directly by 
means of a connecting section into the 
social cage. The latter contained a one- 
way door that prevented the social ani- 
mal from entering the maze. Quite in 
contrast to the previous studies, how- 
ever, Maltzman found that his Ss did not 
respond appropriately to the side con- 
taining the significate for which they were 
motivated on the first test. Thus the 
correct side was chosen only by 17 out of 
40 Ss on the first test, a result obviously 
not significantly different from chance, 
which the Ss had been running during the 
last five training days. The second test, 
involving a reversal of the motivation of 
test one, also gave results that did not 
differ significantly from chance, although 


19 of the Ss did respond appropriately. 
Two subsequent tests on the following 
two days also gave chance results. 

So far as the motivational conditions 
are concerned, our most recent experi- 
ment (24) represents an extension of the 


Kendler study. Food and water were 
again available in one or another of the 
end boxes. Satiated for these objects, 
the Ss were motivated to run in the maze 
by anxiety. Prior to the learning in the 
maze, the Ss were trained to run down 
the stem of the maze, the floor of which 
was an electrified grid, to escape the 
shock in an empty end box. Following 
this preliminary training, 10 days of 
training (four trials per day) in the maze 
situation were given in which the Ss were 
required to escape by either arm. Shock 
was not given during this series, the ani- 
mals presumably being motivated to 
escape from the maze by fear or anxiety. 
The experimental question, as in all the 
above experiments, was whether the Ss 
would be able to acquire the appropriate 
cognitions as to the locus of food and 
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water, while motivated by anxiety and 
rewarded by escape from anxiety. 

Despite an even more thorough at- 
tempt to satiate the Ss for the food and 
water than had been necessary to pre- 
vent eating or drinking in our other in- 
vestigations, 14 of the 24 Ss in this ex- 
periment either ate or drank or did both. 
Considering the results of the remaining 
1o Ss that never drank or ate, six re- 
sponded appropriately on the first test 
and four on the second, a purely chance 
result. 

In marked contrast to this test per- 
formance was that of the Ss that ate or 
drank at one time or another in the ex- 
periment. Thirteen of these fourteen 
Ss responded appropriately on the first 
test and eleven on the second. The re- 
sults for the first test are significant be- 
yond the one percent level of confidence. 
The departure from chance expectation 
(s0 percent correct) for the two tests 
combined is also significant beyond the 
one per cent level of confidence. The 
success of these latter Ss is rather re- 
markable in the light of the fact that only 
about eight reinforcements involving 
drinking and eating were obtained on the 
average per S in the forty training trials. 


Such results as the above, combined 
with the findings of our earlier study 
(22) and its confirmation by Walker 
(28), Kendler (10), Grice (1), and 
Kendler and Mencher (11), which 
have shown that rats are unable to 
acquire either cognitions as to the lo- 
cus of food when not hungry but under 
a thirst drive or cognitions with regard 
to the location of water when not 
thirsty but motivated by hunger, have 
led Tolman to offer a modification of 
his views concerning the acquisition of 
sign-gestalt-expectations, or as he now 
prefers to designate them, field ex- 
pectancies (27). The extent of Tol- 
man’s reversal of viewpoint is shown 
by the following quotation: 


I used to be so impressed by the latent learn- 
ing experiments of the type invented by Blod- 
gett, in which no reward was introduced during 
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the learning periods, that I was apt to formulate 
the conditions involved in such field-expectancy 
learning primarily in terms of frequency alone 
and as if motivation played no role. However, 
if I did this, I was in error. It is obvious that 
completely unmotivated animals will not learn. 
They will go to sleep or otherwise divorce them- 
selves from the task. So it must be emphasized 
that in the Blodgett experiments, even though 
the animals were not rewarded, they were 
motivated. . . . Summing it up then, it appears 
that motivation conditions are very important 
for the building up of field expectancies (27, 
Pp. 150). 


While clear and emphatic about the 
necessity of motivation for learning 
field-expectations, Tolman is less de- 
cisive about the role of reinforcement. 
He denies the view: “that such learn- 
ing consists in the stamping in of S-R 
habits by reinforcements” (27, p. 151). 
But he goes on to state that: “The 
presence of reinforcement in a particu- 
lar locus makes that locus a goal which 
determines what performance will 
take place but it does not stamp in 
S-R connections, though it probably 
does give a special vividness to that 
locus in the total field expectancy” 
(27, p. 151). 

So far as the experimental studies 
discussed above are concerned, they 
point just as strongly to the necessity 
for reinforcement as they do for mo- 
tivation. In every single investiga- 
tion some form of reinforcement was 
present, and in one instance the ex- 
periment could not be conducted 
when no reinforcement was provided. 
However, none of these studies is able 
to answer the question as to whether 
learning can occur in the absence of 
reinforcement. As described earlier, 
they are concerned only with the ques- 
tion as to whether the learning of the 
locus of certain kinds of goal objects 
can occur under motivational-reward 
conditions not involving these ob- 
jects.* 


* There is no implication that these studies 
necessarily demand a reinforcement interpreta- 
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In terms of Tolman’s theory, these 
experiments attempt to answer the 
question as to whether the learning 
involves the formation of cognitions 
or field expectancies involving the 
stimulus events at the beginning of 
each alley and the stimulus events 
(food or water) at the end of each 
alley. The perception of these events 
in sequence is assumed to result in the 
knowledge (cognition) of the relation- 
ship between them. Given these 
cognitions, plus the knowledges that 
these objects satisfy their respective 
primary drives, the choice behavior of 
the subject will depend upon the rela- 
tive demand value (A) of the alterna- 
tive significates. Thus, if the S is 
hungry on the test and satiated for 
water, and assuming food is at the 
end of the left alley, then the strength 
of the tendency or force to the left 
response (B,) will be greater than that 
to the right (Bre). Employing the 
symbols used by Tolman in his article 
on secondary demands (26), this 
derivation may be represented as 
follows: 


Br> Brif Ar > Ar 
Ar > Arif Ar > Aw 


and if hypotheses Hye and Hriw 
exist, 
Ar > Aw if D, > D, 


and if hypotheses Hr.c, H,wc, exist. 

Where A represents secondary de- 
mands, such as demand for a particu- 
lar alley, for food, etc., H represents 
hypothesis, cognition, equivalence be- 
lief, etc., Dy, primary demand for 
consummation of food, i.e., hunger; 
D, primary demand for water, i.e., 


tion and are opposed to a non-reinforcement 
interpretation. While they are in line with a 
reinforcement assumption, they are not critical 
with respect to the issue, as none of the studies 
eliminated completely some form of reinforce- 
ment. 
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thirst, and C, and C, consummation of 
hunger and thirst respectively. 
Taking the results of all of the above 
experiments, the findings are not too 
favorable to Tolman’s S-S theory. 
In those instances in which the Ss 
were motivated for one of the signifi- 
cates, the test data have been uni- 
formly negative. In the experiments 
in which some motivation other than 
that for the significates was employed, 
the findings are ambiguous. In some 
(Spence, Bergmann, and Lippitt; 
Meehl and MacCorquodale, and 
Kendler) one of the tests has agreed 
with Tolman’s theory, whereas a 
second test has not supported it. In 
other studies (15,24) the test data 
have been uniformly opposed to the 
implications of Tolman’s theory. 
Attempting to fall back on the 
claim that the objective conditions of 
presentation of the significates in 
these experiments were not adequate 
for reception, such as Leeper (14) has 
done, is refuted by the positive in- 
stances, as the latter were precisely 
the same as those in which the results 
were negative (33). Furthermore, 
such conditions as Kendler and 
Mencher (11) arranged, preclude the 
possibility that their subjects did not 
‘see’ the food. It is at this point, of 
course, that Tolman introduces the 
factor of motivation. He writes, 
“Thirsty animals apparently do not 
notice food, even though the experi- 
ment be rigged as it was by Kendler 
and Mencher to seem to force them to 
notice that the cups which did not 
contain water did contain food” (27, 
p. 150). That is, under the motiva- 
tion of thirst, the animals did not, he 
claims, notice (perceive?) the food. 
But why, we may ask, did the Ss of 
our present experiment (accepting 
the positive results of the first test) ap- 
parently notice the food and water 
while motivated to get to the social 
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cage? And why, when the social goal 
box was brought nearer to the point of 
entrance to the alleys, as in Maltz- 
man’s study (15), did the rats appar- 
ently fail to notice the food and water? 
Again, what lies behind the failure of 
the Ss to notice the food or water in 
the experiment in which the Ss were 
motivated by anxiety and need to es- 
cape the anxiety arousing situation? 
Under just what motivational condi- 
tions will animals perceive or not per- 
ceive the presence of the significates? 
It is obvious that Tolman’s theory is 
as yet totally inadequte to account for 
these experimental results. Much ex- 
perimental work remains to be done 
before any satisfactory theory con- 
cerning the role of motivation in per- 
ception is achieved. Tolman himself 
is well aware of the primitive, un- 
developed nature of his theorizing, 
even though some of his more ardent 
followers are not. 


We turn now to the S-R interpretation 
of these phenomena. So far as we know, 
neither Hull nor any of his followers have 
attempted to formulate a complete the- 
oretical account of even so simple an 
instance of trial-and-error learning as is 
involved in the single T-maze. Many of 
the critics (3, 13, 18) of Hull have either 
never understood or have forgotten that 
in his Principles of behavior (6) Hull deals 
only with the basic principles revealed in 
classical and instrumental! conditioning. 
Hull’s basic principles do not include all 
of the principles that are important for 
all instances of learning, animal and 
human. By basic he means certain 
principles revealed in simple instances of 
learning from which he believes other, 
just as important, learning principles 
may be derived. Not treated in his 
Principles of behavior are such highly im- 
portant principles as that of the habit- 
family hierarchy (5), the principle of the 
fractional anticipatory goal response (4) 
and a number of others. 

The fractional anticipatory goal re- 
sponse is particularly important for all 
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instances of complex learning including 
the type of problem with which we are 
here concerned.‘ In the original report 
of the present experiment (21) and in an 
unpublished symposium paper presented 
at the 1941 meeting of the Midwestern 
Psychological Association (19) an at- 
tempt was made to show how this mech- 
anism would operate in the present type 
of experiment. Before presenting this, 
however, the necessity for including in 
the theory some other mechanism than 
the usual excitatory potentials between 
the environmental and internal drive 
cues and the responses may be shown by 
considering an experiment of Kendler (8). 
Using a single T-maze, one arm of which 
had food and the other water, Kendler 
had his subjects doth hungry and thirsty 
during the training series. In the test 
series only one of the drives, hunger or 
thirst, was operative. Under this test 
condition Kendler found that his Ss 
responded correctly a highly significant 
number of times. But it is obvious that 
under the training condition each of the 
responses was reinforced equally often 


and hence presumably was of approxi- 
mately equal strength at the completion 


of training. How then can one predict 
the differential strengths under the two 
test conditions? The explanation sug- 
gested by Spence (8,19) was that the 
stimulus cues in the water arm and end 
box become, during training, conditioned 
to fractional anticipatory drinking acts 
(rw) and, in turn, the proprioceptive cues 
(sw) resulting from these anticipatory 
acts become conditioned to the response 
of entering and continuing locomotion 
in this alley. In a like manner, condi- 
tioned, fractional anticipatory eating re- 
sponses (r;) develop in the food alley and 
these cues (s;) become conditioned to the 
behavior of entering this alley. During 
the test series when only one drive is 
operative, the anticipatory act related to 
the goal for which the subject is moti- 


*As one of us (20) has pointed out, the 
fractional anticipatory goal response is an in- 
stance of classical conditioning which itself 
enters into and plays a role in all the more com- 
plex types of learning, including instrumental 
conditioning. 
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vated will, because of the greater strength 
of the particular drive stimulus, Hull's 
principle of stimulus dynamism (7), be 
much stronger (more vigorous) than the 
other and hence will produce stronger 
proprioceptive cues. Thus, if the § is 
thirsty, proprioceptive cues from antici- 
patory drinking responses will be stronger 
than those from anticipatory eating. As 
these cues (s~) will be conditioned to the 
response of entering the alley leading to 
water, they will tend to give this response 
the greater excitatory strength (stimulus 
dynamism). When the animal is hun- 
gry, the proprioceptive cues (s;) would be 
stronger and this would give the response 
of entering the food alley the greater 
excitatory strength. In other words, the 
relative excitatory strengths of the com- 
peting responses will be a function of the 
relative strengths of the two fractional 
anticipatory goal responses, and the 
latter will be dependent upon which drive 
is present and which is absent. 

The application of this principle to the 
experiment of the present study is similar. 
In our experiment, however, the frac- 
tional anticipatory goal responses are 
much weaker as the Ss do not actually 
eat or drink during the training period. 
The sight of the food (or water) evokes 
conditioned fractional eating (drinking) 
responses. These, as described above, 
become conditioned to the cues of their 
respective alleys, the reinforcement being 
provided by the social goal, or possibly, 
as in higher order conditioning, by the 
sight of the food (or water) itself (i-e., 
secondary reinforcement). The much 
greater weakness of these fractional goal 
responses in this experiment as compared 
with the Kendler experiment, in which 
eating and drinking actively occurred 
should lead to much poorer conditioning, 
and hence, presumably smaller differ- 
ences between the excitatory tendencies 
of their proprioceptive cues. This would 
nicely account for the poor test results 
obtained in the present experiment as 
compared with Kendler’s motivated Ss. 

Making use of the same explanatory 
schema, Maltzman (15) has suggested 
that the explanation of the still poorer 
results he obtained was possibly due to 
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greater interference with the fractional 
anticipatory eating and drinking re- 
sponses by the stronger, anticipatory 
social goal responses. According to this 
view, the stronger the motivation opera- 
tive in the training situation and the 
stronger the correlated goal response, the 
poorer will be the test results, as these 
responses will interfere with the fractional 
eating and drinking responses. Another 
implication of this theoretical picture is 
that the more highly differentiated the 
fractional anticipatory goal responses 
elicited by the significates, the better 
will be the results on the tests. 

The failure of the Ss on the second test 
in the present investigation and in the 
similar study of Kendler (10) would be 
accounted for by the fact that all of the 
Ss of our study and those correct on the 
first test in the Kendler experiment were 
rewarded on the first test day. This 
relatively strong reinforcement under 
motivation for the significate or goal 
object would tend to strengthen the 
tendency to enter this alley, thus up- 
setting the balance of strength in favor of 
response to this side. As the second re- 
sponse was always to the opposite side 
there was thus introduced a factor that 
would be unfavorable to success on this 
test. 

This explanation does not hold, how- 
ever, in the case of Meehl and MacCor- 
quodale’s study, as the goal object for 
which the § was motivated on the first 
day’s test was not present. Hence, the 
response to this side would not receive 
this relatively strong reinforcement, but 
only the usual reinforcement provided by 
escape from the maze to the social cage. 
It may or may not be significant that the 
shift from the training performance to 
the appropriate test side was greatest in 
this study even though not significant. 

Finally, the completely negative test 
findings in the studies of Grice (1), 
Walker (28) and Kendler and Mencher 
(11), in which the Ss were thirsty during 
training and found water at the end of 
each pathway and food in one can be 
easily explained in terms of the concept of 
the fractional anticipatory goal response. 
In these experiments the drinking re- 
sponse would be dominant in both path- 
ways and hence would become condi- 
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tioned to the cues at each alley entrance. 
This explanation assumes, of course, that 
the drinking and eating responses are 
incompatible—a not unreasonable hy- 
pothesis. 


It is readily apparent that this S-R 
theory, like the rival sign-gestalt the- 
ory of Tolman, is much in need of more 
specific, detailed elaboration. Even 
in its present stage of development, 
however, it is more detailed and de- 
finitive than that offered by Tolman. 
It has, at least, suggested some of the 
variables, e.g., the various motivations, 
their relative strengths, the relations 
among the several goal responses, etc., 
that are important determiners of the 
behavior in the situation. Tolman’s 
original formulation recognized none 
of these variables and even in his most 
recent formulation he has only vaguely 
referred to the role of motivation. 
One may suspect that when the sign- 
gestalt theorists get around to making 
some definite assumptions, they will 
become more and more similar to 
those suggested above. 


SUMMARY 


The present study is one of a series 
of investigations undertaken in the 
Iowa laboratory to test Tolman’s sign- 
gestalt theory of simple trial-and- 


error learning. The Ss, 39 white 
rats, were given seven days of experi- 
ence (four trials per day) in a single 
T-maze in which one path led to water 
and one to food. The motivational- 
reward condition employed to get the 
animals to run in the maze during the 
training period was that of getting to 
a social cage containing a cage mate. 
In the first of two test series given 
following the completion of training, 
23 of the Ss were run under thirst while 
satiated for food and 16 were hungry 
and satiated for water. In a second 
test on the following day the motiva- 
tional conditions for each S were re- 
versed. 
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The results of the first test showed 
a significant shift from the pre-test 
choice of the Ss to the alley containing 
the goal object for which they were 
motivated. The results of the second 
test, on the other hand, were entirely 
negative, with little or no shift in the 
direction of the appropriate alley. 

The experimental findings are dis- 
cussed in relation to both the cognitive 
theory of Tolman and an S-R theory 
that makes use of the fractional anti- 
cipatory goal response. 


(Manuscript received October 20, 1949) 
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AN EXPERIMENTAL INVESTIGATION OF TRANSPOSITION 
AS A FUNCTION OF THE DIFFERENCE BETWEEN 
TRAINING AND TEST STIMULI! 

BY TRACY SEEDMAN KENDLER 
Psychological Laboratory, State University of lowa 


In the usual discrimination prob- 
lem, two or more members of a stim- 
ulus dimension, e.g., size or brightness, 
are presented to the subject. Re- 
sponse to one of them is consistently 
rewarded while response to the other 
either receives no reinforcement, or is 
punished. After a number of trials 
the subject learns to choose one of the 
stimuli consistently. The question of 
whether this learning is based on a 
tendency to respond to specific stim- 
ulus aspects or to the relational fea- 
tures of the stimulus complex, e.g., 
brighter or larger, has been of consid- 
erable theoretical importance. 

The Gestalt psychologists have nat- 
urally been interested in demonstrat- 
ing that the relational aspects of learn- 
ing are the most significant. They 
have emphasized the fact that experi- 
mental studies have _ consistently 
shown that after subjects have learned 
the discrimination problem, e.g., have 
learned to choose the larger of two 
stimuli and are presented with the 
originally positive stimulus and a still 
larger one, they now respond more 
often to the new and larger stimulus 
than to the one that was originally 
positive. Such apparent response to 
the relational aspect of the stimulus 
complex has been interpreted by the 

The present paper constitutes the major 
portion of a dissertation submitted in partial 
fulfillment of the requirements for the degree of 
doctor of philosophy in the Department of Psy- 
chology at the State University of lowa in 1943. 
The writer wishes to thank Professor Kenneth W. 
Spence for his invaluable aid in the direction of 
the investigation. Appreciation is also extended 
to Dr. David Ehrenfreund for his untiring co- 
operatio n in the construction of the apparatus. 


Gestalt psychologists as irreconcilable 
with associationistic and stimulus- 
response theories. 


In 1937, however, Spence (7) proposed 
a theoretical scheme based on stimulus- 
response principles to explain such ‘rela- 
tional’ responses. Utilizing the concept 
of generalization as revealed by condi- 
tioning experiments, plus a theoretical 
formulation of discrimination learning 
(6), he was able to explain such behavior 
without assuming that the response was 
to the relational aspects of the situation. 
His theory is based on the following five 
assumptions: 


1. As a result of successive reinforce- 
ments, the positive stimulus of a discrim- 
ination combination acquires an excita- 
tory tendency of a given amount to the 
response of approaching it, etc. 

2. There is a generalization of this 
tendency along the stimulus continuum 
and it takes the following form: E = 
Ma~*** (where X is the distance in 
logarithmic units between training and 
test stimuli). 

3. Failure of reinforcement results in 
the development of an inhibitory tend- 
ency of a given amount for the non-rein- 
forced stimulus. 

4. This inhibition generalizes along the 
stimulus dimension according to the same 
general formula as does the excitation. 

5. The effective excitatory strength of 
any stimulus member of the dimension 
is the algebraic summation of these ex- 
citatory and inhibitory tendencies. 


The implications of this theory can be 
most readily seen from an examination of 
Fig. 1, a diagrammatic representation 
taken from one of Spence’s later articles 


(8). The subject is assumed to have 
been trained to choose a stimulus square 
of 160 sq. cm. in preference to one of 100 
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39 62 100 


STIMULUS SIZE 


160 
a 


e SQ.CMS. 


Fic. 1. Diagrammatic representation of the relations between the hypothetical generalization curves, 
positive and negative, after training on the stimulus combination 160(+-) and 100(—) 


sq. cm. According to the theory, if 
stimulus 160 is presented with stimulus 
256, the effective excitatory strength of 
the new stimulus is greater than the 
original positive one and therefore the 
animal would be expected to respond to 
the new and larger stimulus. 

Although most infra-human organisms 
respond in a manner more or less con- 
sistent with the direction of the original 
training, the percentage of such responses 
to the transposed pair of stimuli is often 
fairly low and sometimes even in an op- 
posite direction. It is important for any 
theory of transposition to be able to ac- 
count for such variations. Spence’s 
theory would suggest that one of the sig- 
nificant variables determining the pro- 
portion of choices consistent with original 
training in such transposition tests is the 
distance of the test pair from the original 
training pair. Experiments by Gullik- 
sen (2), Kliiver (5), Spence (7) and Flory 
(1) have obtained evidence which give 
definite support to this general implica- 
tion. Thus when the amount of trans- 
position (percentage of responses con- 
sonant with original training) on test 
pairs near the original training pair is 
compared with the amount occurring on 
test pairs farther removed, the latter has 
almost invariably been smaller. 


None of the previous experimental 
studies, however, has provided suffi- 
cient data to permit a description of 
the relationship holding between the 
amount of transposition and distance 
along the stimulus dimension between 


the training and test pairs. The 
present investigation was aimed spe- 
cifically at a determination of the pre- 
cise nature of this law in the case of 
brightness discrimination with rats, 
and also a study of the implication of 
the results for a theory of transposi- 
tion. 


EXPERIMENTAL PROCEDURE 
Apparatus 


The apparatus used was a modification of one 
employed previously by Flory (1) and Spence 
(9). Its principal parts are shown in the di- 
agram in Fig. 2. A turntable, divided into two 
platforms, Art and A2, by means of partitions, 
stood about four feet above the floor and moved 
on a pivot made by two iron pipes. The smaller 
pipe, which was fastened to the table, fitted 
closely inside a larger one which rose from a base 
on the floor. A one-foot alley (B), two in. in 
width, connected the two open sides of the table 
and a vertically sliding door in the middle of this 
alley permitted E to control the subject’s passage 
from one platform of the apparatus to the other. 

The turntable was utilized in the following 
manner: At the beginning of each day’s trials, 
S was placed on the far platform of the apparatus 
(Az in diagram) and then the door was raised, 
permitting access to the platform adjoining the 
stimulus pathways. After making its choice the 
animal was encouraged to go back on to the turn- 
table. £ then rotated the apparatus so that the 
subject was again on the platform distant from 
the stimulus objects. Upon the appropriate 
adjustment of the stimulus objects, the next 
trial ensued. 

The second main portion of the apparatus, 
namely the stimulus objects (brightnesses) to be 
discriminated consisted of two glass pathways 
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(C1 and C2) 9} in. long and 4} in. wide. These 
pathways could be differentially illuminated by 
means of light boxes (H1 and Hz2) located im- 
mediately below them. These boxes were ar- 
ranged side by side on a two by four and were 
placed directly beneath the plywood board sup- 
porting the stimulus paths. Identical apertures 
(jt and Jz) 7§ X 44 in., were cut in both the 
plywood board and the top of the light boxes to 
permit the light to pass from the light sources up 
to the paths. The boxes could be rotated from 
one side to the other by means of a pulley system 
controlled by a wheel behind the screen, thus 
permitting the position of the stimuli to be 
changed from trial to trial. The light source for 








each path was an electric bulb at the bottom of 
each light box. 

Eight in. from the floor of the box, a plate of 
milk glass was inserted in order to diffuse the 
light. One in. above this glass was a small shelf 
designed to support shutters in which different 
sized openings were cut. After the light from the 
bulb passed through the lower milk glass and 
the circular opening, it filtered through a cut of 
ground glass that covered the aperture in the ply- 
wood board and finally reached the path which 
was itself constructed of milk glass. The glass 
and the empty spaces through which the light 
filtered served to diffuse the light evenly over 
the pathway and keep it from getting over- 






































Fic. 2. Isometric diagram showing the essential features of the apparatus 








TRANSPOSITION 


heated. All portions of the apparatus that were 
visible to the rat, except the stimulus paths, were 
painted a flat black. 

The brightness values of the stimulus paths 
were controlled and manipulated by: (1) the 
size of the circular openings, (2) the wattage of 
the bulbs, and (3) the voltage impressed upon 
them. Using these devices in the various com- 
binations indicated in Table I, eight brightness 
values ranging from .o11 to 204.7 ft. c. were ob- 
tained. Brightness is here defined as the meas- 
ure in apparent foot candles obtained with the 
use of the Macbeth Illuminometer. An attempt 
was made to have each successive stimulus value 
approximately five times brighter than its prede- 
cessor, thus providing for stimulus pairs with a 
constant ratio between the two brightnesses. 
As may be seen from Table II however, this ratio 
was not quite constant, but varied from 4.4 to 
5.92 


Stimulus conditions in learning and test series 


Two different conditions of original training 
were used. Under one (Condition I) the subjects 
(N = 58) were trained to choose the brighter 
member of stimulus pair No. o in Table II, ie., 
stimulus .063 in preference to stimulus .o1t. 
Following attainment of the criterion of learning 
each subject was given a series of 10 test trials 


TABLE I 


ComBtnaTion oF WATTAGE, VOLTAGE AND 
Area or Circutar Opinincs Usep 
ror Eacu Baricutness Stimutus 











Brightness in 
Apparent Foot! Wattage! Voltage 
Candles 


Area of Vircular 
Opening in 
Sq. Cm 
~ | — ——= ——— 
| 100 3-14 
100 (| 12.6 
100 22.9 
100 | 55-4 
100 | 358.7 
100 615.8 


130 318.5 
130 | No shutter 





*For these two stimulus brightnesses both 
pieces of filtering glass were removed. 


2 According to the photometric measures ob- 
tained just prior to running the first subjects, the 
ratios were actually five to one. When the 
measures were repeated toward the close of the 
experiment, the values deviated somewhat from 
the original measures. It is the second set of 
measures that appears in Tables I, II, and III. 
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TABLE Il 
Stimutus Parrs Usep 1x Conprrion I 








Brightness in Apparent 
Foot Candles 
Stimulus 
Pair 





Positive Negative 


ol 
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with one of the six stimulus combinations shown 
in Table II. The subjects were distributed 
among the various test groups in a random man- 
ner; all groups contained 10 subjects except the 
group which was tested on the stimulus com- 
bination employed in the original learning. 
There were cight subjects in this latter group. 

In a second condition (Condition II) the sub- 
jects were trained positively to the darker path- 
way of stimulus pair No.oin Table III™ After 
learning this discrimination, these subjects were 
tested with one of the darker stimulus combina- 
tions. To each of the four test groups six sub- 
jects were assigned at random. 

In the first condition, then, training was to 
the brighter of a pair of stimuli well down in the 
brightness scale and the extent of transposition 
to pairs of brighter stimuli of approximately the 
same ratio was measured, In the second condi- 
tion the subjects were trained initially to the 
darker of two relatively bright stimuli and trans- 
position tests were given on still darker stimulus 
pairs. We can speak of the transposition tests 
being in a positive direction, by which is meant 
that they are in the direction of the positive 
member of the original stimulus pair. We can 
also measure the extent of the distance from the 
original training pair in terms of the successive 
step intervals, the ratios of which are approxi- 
mately five to one. 


Training and test procedures 


Following a period of several days in which the 
subjects were fed their daily diet of 54 gm. of 
Purina Dog Chow at a regular hour and were ac- 
customed to handling, preliminary training on 
the apparatus was begun. On the first day the 
subject was placed on the apparatus and per- 

™* The brightest stimulus combination could 
not be employed as a training pair because the 
apparatus overheated when utilized in a long 
learning series. 
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TABLE Ill 
Stmavutus Pairs Usep mx Conprrion II 








Brightness in A nt 


Stiomushus Foot Cand 


if 




















mitted to explore it. On the two following days, 
10 small food pellets were placed in each of the 
food troughs at the end of both stimulus paths 
(D1 and D2) and the subjects were permitted to 
find and eat them. During these trials both 
pathways had the same brightness value, that 
of the positive stimulus of the original training 
pairs. Additional general illumination was 
provided on the first of these two days by a light 
in the box shown at the top of the turntable. 
The only illumination on the last day of prelimi- 
nary training and all subsequent training and test 
days was that provided by the stimulus paths. 
During the training and test series, the rats were 
kept in the dark room for 15 min. before they 
were placed on the apparatus to permit dark 
adaptation to occur. 

The learning series, consisting of 10 trials per 
day, was continued until the criterion of learning 
was attained. The criterion was 18 out of 20 
correct trials in both Conditions I and II, except 
that the further requirement that the last 10 
trials be correct was adhered to in Condition I. 
Because the discrimination probiem proved to 
be much more difficult in Condition II, the 
added requirement that all of the last 10 trials 
be correct was dropped. A correct response was 
rewarded by a small pellet of food (.2 gm.) while 
failure of such reinforcement constituted the 
sole consequence of responding incorrectly. The 
position of the positive pathway was varied ac- 
cording to the following schedule: RLLRRL- 
RLLR or the reverse of this order. 

In the test series each animal was tested on 
one stimulus combination only. Ten trials were 
given and responses to either stimulus pathway, 
darker or lighter, were rewarded regardless of 
the original learning conditions. 


Subjects 

In all, 82 hooded rats were used as subjects. 
Fifty-eight animals, half of which were males, 
were trained under Condition I; 24 subjects, 13 


males and 11 females, were trained under Condi- 
tion I], All animals were from the colony main- 
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tained by the Department of Psychology, State 
University of Iowa. They ranged in age from 
75 to 145 days at the beginning of training. 


REsuLTs 
Acquisition of discrimination habit 


The initial discrimination problem 
proved to be a difficult one to learn, 
especially in the case of the subjects 
trained under Condition II, i.e., to 
the darker of the stimulus pair. The 
latter required an average of 446 trials 
to reach their more lenient criterion 
while the subjects that learned under 
Condition I took, on the average, 252 
trials to learn. This difference in 
difficulty of the two learning situations 
suggests the possibility of a preference 
for the brighter stimulus.?. Another 
factor that possibly contributed to the 
difference is the fact that the ratio of 
the brightnesses was somewhat smaller 
in Condition II (4.8:1) than in Condi- 
tion I (5.7:1). 


Trans position test results 


It has generally been accepted that 
the best measure of transfer effects is 
provided by the first test trial, since 
all subsequent trials in the test series 
are likely to reflect the effects of the 


test trials. It is, however, infre- 
quently applied because of the large 
number of subjects required for reli- 
ability. Most studies apply a less 
stringent measure, such as the percent 
of ‘correct’ responses in a series of five, 
ten, or more test runs. In the present 
research three measures were planned 
and a fourth one emerged from the re- 
sults. The three planned measures 
consisted of the percent of responses 


*In an exploratory study preceding the pres- 
ent investigation, a slight but statistically sig- 
nificant (five percent level) preference for the 
brighter member of a stimulus pair was found. 
The subjects, 20 hooded rats, were each given 20 
trials which were rewarded regardless of which 
stimulus they chose. 
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PERCENT OF RESPONSES 
CONSISTENT WITH THE DIRECTION 
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PERCENT OF RESPONSES 
FIRST S TRIALS 92 6c 
ALL 10 TRIALS 95 63 
LAST 5 TRIALS 98 66 


72 
76 
60 


70 
6 
92 


54 
67 


60 78 


Fic. 3. The percent of responses consistent with the direction of training for subjects trained to 
the brighter stimulus and transposed up the brightness scale (Condition 1) 


consistent with the direction of train- 
ing in: (1) the initial test trial, (2) the 
first five test trials, (3) 10 test trials. 
The fourth measure was composed of 
the percent of transposed responses in 
the second five test trials.‘ 


The results on the initial test trial 
are presented in Table IV. Because 
of the small'N the results appear quite 
unstable. 

The percent of transposed responses 
for the first five test trials, the second 
five test trials and all 1o trials are 
presented in Figs. 3 and 4. 


*In Condition I the subjects were trained to 
the brighter stimulus. A response ‘consistent 
with the direction of training,’ or a ‘transposed 
response,’ during the test trials would be the 
choice of the brighter stimulus of the test pair 
presented. For Condition II subjects, who were 
trained to the darker stimulus, a response ‘con- 
sistent with the direction of training’ would be 
towards the darker stimulus of the test pair 
presented. 


An examination of the transposition 
results presented in Figs. 3 and 4 re- 
veals that, regardless of which measure 
is invoked or which experimental con- 
dition is described, there was a tend- 
ency for the subjects to respond in the 
direction consistent with the direction 
of training at every step tested. The 
percent of ‘consistent’ responses was, 
however, less at each of the transposed 
steps than it was at the zero (training) 


TABLE IV 


Percent or Susjyects Wuose Inrriat Test 
Responses Were ConsisTENT WITH THE 
Direction or TRAINING 








Steps Removed from 
Training Pair 
rs) | I | 2 | 
Condition I 88 | 80 7 
Condition II 67 











| 
J 
| 3 
| 
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Step. The hypothesis that the rela- 
tionship between the amount of trans- 
position and the distance from the 
training pair is suitably described by 
a straight line parallel to the X-axis 
may be rejected by simple inspection. 

It can also be seen that in both 
conditions the amount of transposition 
decreased as the distance between 
training and test pairs increased, up 
to a certain point; which point and 
what direction the curve takes after 
that minimum point is reached varies 
with the experimental condition de- 
scribed and the measure of transposi- 
tion involved. The difference between 
the results obtained under the two 
conditions can be dealt with most sim- 


PERCENT OF 
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plyzin terms of the amount of trans- 
position obtained on all 10 test trials. 
In Condition I the relationship be- 
tween the amount of transfer and the 
distance between training and test 
pairs resembles a U-curve, with the 
amount of transposition decreasing for 
three steps and then increasing again. 
In Condition II the amount of trans- 
position decreased consistently until 
the third step, where it leveled off at 
67 percent. Due to the limitations of 
the apparatus, there was no fifth step 
in Condition II, and consequently no 
way of knowing whether an additional 
step would continue the previous trend 
or provide a rise like the‘one found in 
Condition I. 





PERCENT OF RESPONSES 
CONSISTENT WITH THE DIRECTION 


OF TRAINING IN* 

FIRST S TRIALS 
ome 411 10 TRIALS 
LASTS TRIALS 





























PERCENT OF RESPONSES 
FIRST S TRIALS 77 63 
ALL 10 TRIALS 92 60 70 $7 67 
LASTS TRIALS 62 79 72 69 


Fic. 4. The percent of responses consistent with the direction of training for subjects trained to 
the darker stimulus and transposed down the brightness scale (Condition II). 
(The heavy dotted line represents an estimate of the subjects’ performance on Step o during the 


10 test trials. The value assigned to this step is the percent of correct runs made by the 24 subjects 
in the 20 criterion runs. This procedure was utilized because the following results were obtained in 
Condition I: Eight animals, in this latter condition, that were trained on Step o were actually pre- 
sented with the same pair of stimuli for 1o test trials. The percent of correct responses for these 
subjects on the 20 criterion trials was 94; the percent of correct responses on the 10 test trials was 95. 
Since there was such similarity between the results obtained in the criterion and test trials on Step o 


in Condition I, it appeared justifiable to estimate the corresponding value in Condition I as described 
above.) 
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Another salient result to be noted is 
the increase in the amount of transpo- 
sition as the test trials proceed. That 
is, when the percent of transposition 
obtained in the first five test trials is 
compared with the percent of transpo- 
sition in the second five trials, the 
latter is greater at every experimental 
step in both conditions. A difference 
in the same direction that is repeated 
on 10 independent points would occur 
by chance alone less than one time in a 
thousand. It therefore appears rea- 
sonable to accept the difference in the 
amount of transposition between the 
two measures (first and second five 
trials) as reliable. 


Discussion oF RESULTS 


One of the purposes of the present 
investigation was to ascertain the na- 
ture of the law relating the percent of 
responses in the transposition tests 
consistent with the original training, 


to the difference between the training 


and test stimulus pairs. Ordinarily 
the next step in an investigation of this 
sort is to fit a mathematical function 
to the empirical results which relates 
the dependent to the independent va- 
riable. ‘The experimental results have 
shown, however, that the nature of 
this relationship differs somewhat (es- 
pecially in Conditon I) depending on 
which measure of transposition is used. 
The first trial appears, in this re- 
search, to be too unstable a measure 
on which to base a functional relation- 
ship. Analysis of the results obtained 
on all ro test trials reveals that the 
last five test trials contain signifi- 
cantly more transposition than the 
first five trials. It is clear that in 
ascertaining the empirical function 
sought it becomes desirable to deter- 
mine which of the various measures 
most directly reflects the variables 
that the investigation was primarily 
concerned with. 


559 


It would ordinarily appear that the 
first five trials would be least influ- 
enced by extraneous factors; and that 
the rise in transposition in the later 
stages of the test series was due to the 
reinforcements received on the previ- 
ous test runs. Fortunately, this hy- 
pothesis could be tested in the follow- 
ing manner: The data were scanned 
for subjects that responded more fre- 
quently during the first five test trials 
in the direction that was not consistent 
with the direction of training. If the 
reinforcements that occurred during 
the test trials were a significant factor 
it would be expected that these ani- 
mals would show a decrease in trans- 
position in the second five trials since 
they had received more reinforcements 
for responding in the direction not 
consistent with the original training. 
There were three subjects, both con- 
ditions combined, that transposed only 
once in the first five trials. In the 
second five trials their mean transpo- 
sition was 3.0. Similarly, there were 
13 subjects that transposed only twice 
in the first five trials. Their mean 
transposition in the second five trials 
was 3.6. Apparently there was a rise 
in transposition for these 16 rats ia 
spite of receiving more reinforcements 
for responding in a direction not con- 
sistent with their original training. 

If the greater percent of transposi- 
tion in the second five trials is not at- 
tributable to learning during the test 
trials, a more satisfactory explanation 
of the discrepancy might lie in the fact 
that the amount of transposition in 
the first five trials was somehow de- 
pressed. There are several possible 
explanations for how this might have 
occurred. 


One explanation involves consideration 
of the light adaptation of the subjects. 
It is relevant to repeat here that the ex- 
periment was performed in a dark room, 
the stimulus paths pro ‘ding the only 











560 


illumination. The subjects were kept in 
the dark room for 15 min. before they 
were placed on the apparatus each day 
to permit dark adaptation to occur. 
There were no data on dark adaptation in 
rats, and therefore this time was a rough 
estimate. The consistent increase in 
transposition, found even at the zero 
(training) step, suggests the possibility 
that a certain amount of consequent ex- 
posure to the illumination level of the 
brightnesses to be discriminated was re- 
quired for the adaptation mechanisms of 
the eye, light or dark, as the case may be, 
to reach their maximum efficiency. If 
this be true, the initial trials would pro- 
duce less transposition than the later 
trials as a consequence of poorer dis- 
criminability. 

Another possible explanation would be 
that the test stimuli, since they had not 
been previously experienced, acted as 
external inhibitors for all responses, tend- 
ing more to chance reaction. As the 


test trials continued, the new test stimuli 
gradually lost their inhibiting power and 
therefore the amount of transposition 


increased. 

Since this study was not designed for 
this purpose, the data do not lend them- 
selves to a test of the validity of these 
alternative explanations (except for the 
elimination of the reinforcement hypoth- 
esis). Therefore, although the explana- 
tions proferred imply that the latter half 
of the test series provides a less contam- 
inated measure, there is no way to prove it. 
Moreover, since there are no reports in 
the literature of transposition of a similar 
phenomenon, it remains to be seen 
whether this result is an artifact of the 
present experimental procedure or a 
phenomenon common to all test series 
with new stimuli. 


Mathematical functions fitted to the 
transposition curves are not presented 
for two reasons: (1) it is not clear 
which of the measures is least influ- 
enced by extraneous factors in the test 
trials; (2) none of the measures lends 
itself to a simple, reasonable mathe- 
matical description for both experi- 
mental conditions. 
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We turn now to a second purpose of 
this investigation, the relation of the 
results obtained to the theory of trans- 
position proposed by Spence (7). The 
theory postulates that the differential 
tendency to choose one stimulus over 
the other is a function of the difference 
between the generalized effective exci- 
tatory strengths of the two stimuli. 
Attention should be called to the fact 
that Spence, due to the lack of suffi- 
cient data, has not attempted to make 
specific postulates about the nature of 
stimulus generalization curves under 
varying experimental conditions. He 
did make the general prediction, so far 
as the present variable is concerned, 
that the amount of transfer will de- 
crease as stimulus pairs farther re- 
moved from the test pair are employed 
in transposition tests. Actually even 
this general prediction is not independ- 
ent of the specific generalization curves 
involved. 

Spence has himself pointed out to 
the writer, in private communication, 
the following implications of his the- 
oretical schema (Fig. 1) when ex- 
amined in detail. In this figure the 
values appearing within the curves 
represent the amount of effective ex- 
citatory strength accruing.to each 
stimulus. The tendency to choose 
one stimulus in favor of the other is 
proportional to the size of the differ- 
ence in excitatory strengths between 
them. Beginning with the difference 
between the original training pair, we 
see that these values for the successive 
pairs of stimuli are as follows: 22.0, 
20.4, — 20.0, —33.6, —18, —14, 0, 0, 
etc. That is, according to this 
schema, the percent of responses con- 
sistent with the direction of the origi- 
nal training should at first decrease 
until they drop to well below the 50 
percent level (i.e., lead to a reversal 
of the originally learned preference) 
and subsequently rise and level off at 
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a response value, presumably 50 per- 
cent, where the difference between the 
theoretical effective excitatory 
strengths becomes zero. 

The empirical transposition curves 
obtained from the present experiment 
do decrease at first, but they do not 
drop either to or below the chance 
level. In Condition I the percent of 
transposition in 10 trials decreases for 
steps 1, 2, and 3 to 83, 76, and 67 per- 
cent, respectively. But at steps 4 and 
5 the curve rises to 75 and 81 percent, 
respectively. These results corre- 
spond with the expectations from 
Spence’s schema insofar as there is a 
decrease in transposition and a sub- 
sequent increase. The change in di- 
rection predicted from the schema 
should, however, take place below the 
chance level. The present results are 
all above chance. In Condition II, 
the curve representing the percent of 
transposition in 10 trials decreases for 
steps I, 2, and 3 as follows: 80, 70, and 
67 percent. At step 4 the transposi- 
tion is the same as at step 3, 67 per- 
cent. That is, there is no reversal and 
the curve appears to be levelling off at 
a value well above 50 percent. 

These data do not appear to corrob- 
orate all the expectations fror the set 
of theoretical generalization curves 
utilized by Spence to illustrate his the- 
ory. From this we may deduce either 
that the theory that transposition is a 
function of the difference between the 
generalized excitatory strengths is in- 
correct or, what is more likely, that the 
type of generalization curves depicted 
in Fig. 1 are not applicable to the pres- 
ent experimental situation.* A better 
test of Spence’s theory will be made 
when more is known about the nature 
of stimulus generalization. 


* There is another possible explanation of the 
present results. It would involve positing that 
there are other factors in the present training or 
test situations that act in addition to those 
posited by Spence to determine the final results. 
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It should be pointed out that if 
Spence’s theory should ultimately be 
proved inadequate, it would not mean 
that transposition cannot be explained in 
stimulus-response terms. There is no 
requirement in modern behaviorism that 
the stimulus be a purely elemental as- 
pect of the environment. In fact Hull 
takes pains to overcome “the misleading 
suggestion of singularity and simplicity 
otherwise likely to be conveyed by the § 
of the symbol s/»”’ (3, p. 206). He de- 
votes a chapter in this book to a discus- 
sion of the conditions under which the 
patterning of stimulus compounds takes 
place. 

It is possible to describe the present re- 
sults within the framework of the Hullian 
theory. The simplest description would 
be that there was some habit strength 
built up to respond to the pattern pre- 
sented by the training pair of brightnesses 
and that the transposition results ob- 
tained represent the generalization along 
the brightness dimension of this habit 
strength. It may well be, however, that 
the situation is considerably more com- 
plex, i.e., while the habit strength to the 
pattern is being built up there are, as 
Spence postulates, some excitatory and 
inhibitory strengths being built up to the 
individual training stimuli. A_ third 
factor, the tendency for responses to be 
greater to more intense stimuli, recently 
termed stimulus intensity dynamism by 
Hull (4), is probably active here. If this 
were the case, the results obtained would 
represent a combination of the general- 
ization from the above two types of learn- 
ing plus the stimulus intensity dynamism. 

A more precise explanation of the pres- 
ent results may have to wait until there 
are more data available about the nature 
of generalization of both individual stim- 
uli and stimulus compounds and, per- 
haps also, the way that these two types 
of generalizations interact. 


SUMMARY 


The purpose of this investigation 
was to determine the nature of the 
functional relationship existing be- 
tween the amount of transposition 
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and the distance between the pair of 
test stimuli and the original pair of 
training stimuli. 

Hooded rats (N = 82) were trained to 
choose one of two differentially illu- 
minated pathways to food. After the 
criterion of learning was reached, each 
subject was presented with one of a 
series of different pairs of illuminations 
at varying distances (steps) along the 
brightness dimension from the train- 
ing pair. They were given 10 test 
trials. 

In Condition I, the subjects (N= 58) 
learned to choose the brighter of two 
relatively dark pathways and were 
transposed up the brightness scale. 
The subjects in Condition II (N= 24) 
learned to choose the darker of two 
relatively bright pathways and were 
transposed down the brightness scale. 
The ratio between the brightnesses of 
the stimuli that composed the test and 
training pairs in both conditions was 
approximately § to I. 

The results showed that, in Condi- 
tion I, transposition decreased as a 
function of the difference between 
training and test stimuli for three 
steps. There was a reversal of the 
tendency for the fourth and fifth step, 
i.e., the amount of transposition in- 
creased. In Condition II the amount 
of transposition decreased as the dis- 
tance between the training and test 
stimuli increased until the third step, 
where it levelled off. An analysis of 
the test trials indicated that there was 
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more transposition in the last half of 
the test series than in the first half. 

The empirical results could not be 
readily fitted to a simple, reasonable 
mathematical function. The results 
are discussed in relation to Spence’s 
theory of transposition. 


(Manuscript received August 22, 1949) 
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THE EFFECT UPON GENERALIZED DRIVE STRENGTH 
OF EMOTIONALITY AS INFERRED FROM THE LEVEL 
OF CONSUMMATORY RESPONSE! 


BY ABRAM AMSEL 


Newcomb College, Tulane University 


AND 


IRVING MALTZMAN 
University of California at Los Angeles 


INTRODUCTION 


If one postulates, as Hull (4) does, 
supported by the data of Perin (8) and 
Williams (11), that excitatory poten- 
tial and hence response strength is 
some joint multiplicative function of 
the learning variable (strength of 
habit) and the motivational variable 
(drive strength), then it must follow 
that any increase in the strength of 
drive will increase the strength of 
ongoing responses. 


The definition of drive (D) concerns 


us most here. This is a construct rep- 
resenting the total generalized drive 
strength. It is thought of as a non- 
specific motivational variable, con- 
tributed to by all the separate need 
states which are present in an organ- 
ism atany moment. Any single need 
state can be strictly defined by the 
particular set of antecedent circum- 
stances which give rise to it. Need 
states, which may contribute drive 
strength to the total generalized drive 
condition, are classed as either rele- 
vant or irrelevant, a relevant need 
being one which is reduced by the re- 
sponse being observed in a particular 
experimental situation, an irrelevant 
need being one which is not so reduced. 
The formula suggested by Hull for 
estimating the total drive strength 


1 The experimental portion of the study here 
reported was carried out in the Psychological 
Laboratory of the State University of lowa. 


from the combination of relevant and 
irrelevant needs is: 
D D+D 
D+ Mp,’ 

where D represents the drive strength 
contributed by irrelevant needs, D the 
drive strength contributed by the rele- 
vant need, and Mp represents the max- 
imum motivational strength. Ac- 
cording to the above formula, any 
additional irrelevant need introduced 
in an experimental situation will add 
to the total drive strength unless the 
relevant need contributes, by itself, a 
drive strength equal to the maximum 
(Mp). 

On the basis of the foregoing it 
would be expected that, ordinarily, 
any incident or event, in an experi- 
mentally controlled situation, which 
introduces an additional need state 
not necessarily relevant to the response 
being observed, will increase the 
strength of that response. This has 
not, however, been found to be uni- 
versally true. Several studies have 
shown that the strength of a previ- 
ously learned response in a particular 
situation is diminished by the addi- 
tion of an emotionally derived need to 
the existing motivational complex.’ 


2 The distinction will be made here and sub- 
sequently (2) between appetitional needs, which 
characteristically result from deprivation of a 
needed substance, and emotionally derived needs 
which are related to noxious stimulation. Three 
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Estes and Skinner (3) set up a lever 
pressing response in a Skinner box with 
grid floor, using rats as subjects. Hun- 
ger was the relevant motivation. Sub- 
sequently the animals were shocked in 
the same situation immediately following 
the three-min. presentation of a tone. 
After several tone-shock combinations, it 
was observed that the presentation of the 
tone resulted in reduction of the rate of 
lever depression. In this instance the 
addition of anxiety to the motivational 
complex decreased the strength of a pre- 
viously learned response, though the rel- 
evant hunger motivation was still present. 

An experiment of Mowrer and Viek 
(7) is also pertinent to the present dis- 
cussion. They found that a response of 
eating in a particular situation was in- 
hibited during a period when the sub- 
jects were anticipating shock, and that 
greater inhibition was associated with 
what they took to be a more intense fear 
state. Here again, the presence of fear 


(anxiety) as an additional need state did 
not increase the strength of the response 
being observed, but rather decreased it. 


In contrast to the two studies cited and 
that of Kendler (5), which showed the 
weakening of a bar-pressing response 
with the addition of a strong irrelevant 
thirst drive to the relevant hunger drive, 
are experiments by Miller (6), Webb 
(10), and Amsel (1), which have shown 
how response strength may be increased 
through the introduction of additional 
irrelevant needs. Of these, Miller’s 
experiment affords the best comparison 
with those of Estes and Skinner and of 
Mowrer and Vick, since it also employed 
an emotionally derived need as the ir- 
relevant part of the motivational com- 
plex. In this study, rats were trained in 
a T-maze under relevant hunger motiva- 
emotionally de rived needs will be differentiated: 
pain (actually the need for pain reduction) which 
is contiguous with noxious stimulation; anxiety 
(or fear), which is the conditioned pain reaction; 
and emotionality (a term employed by Siegel and 
Stuckey, 9) which is a perseverative response of 
the organism following pain, thought of as having 
motivational properties. Pain and emotional- 
ity have the status of primary needs, since they 
are independent of learning; anxiety (fear) has 
the status of a secondary (learned) need. 
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tion. When they had learned the correct 
response to food, they were satiated and 
half of their number was shocked briefly 
on a grid separate from the maze, immedi- 
ately before running the maze. The 
animals which were shocked ran more 
quickly and with fewer errors. In this 
case, the irrelevant need of emotionality 
was facilitating to the previously learned 
response.’ 

Apparently the addition of an emo- 
tionally derived need to a motivational 
complex may increase or decrease the 
strength of a response learned prior to 
the presence of that need. The posi- 
tion to be taken here is that such 
seemingly conflicting results can be 
shown to depend upon the relationship 
of the drive stimulus (Sp), characteris- 
tic of the emotionally derived need, to 
the response in question. If the intro- 
duction of an emotionally derived need 
into an experimental situation pro- 
vides a drive stimulus which has been 
conditioned to a response which com- 
petes with the one being observed in 
that situation, then, although the gen- 
eralized drive level may actually be 
stronger, the frequency and magnitude 
of the original response might still be 
reduced, In other words, the intro- 
duction of the new need state, with its 
characteristic stimulus component, 
may be thought of as changing the re- 
sponse hierarchy in the given experi- 
mental situation. Such an analysis 
seems particularly relevant to the ex- 
periment of Estes and Skinner. A 
need to reduce anxiety, conditioned to 
the tonal stimulus, would be aroused 
whenever that tone was sounded in the 
Skinner box situation. The total mo- 
tivational level might now be pre- 
sumed to have increased by the addi- 
tion of anxiety motivation to the 
dominant hunger motivation. How- 
ever, other responses conditioned to 


* The experiment was repeated with shock 
administered for one sec. at the starting point of 
the maze and the results were the same. 
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the anxiety stimulation (crouching, 
escape responses, anticipatory re- 
sponses to shock) might now success- 
fully compete with the previously 
dominant lever-pressing response. It 
might appear, then, as if the addition 
of a need state to a motivational com- 
plex reduced the level of motivation. 

If, on the other hand, the introduc- 
tion of an emotionally derived need 
does not provide stimuli which give 
rise to responses which compete with 
the previously dominant one, then 
that response should be observed to in- 
crease in strength when the new need 
is introduced. This would seem to fit 
Miller’s experiment. In that instance 
only one shock, of one sec. duration, 
was administered away from the maze 
choice point. It could not then be ex- 
pected that the subsequent stimula- 
tion from emotionality would be con- 
ditioned to responses in the maze 
which could compete with the domi- 
nant maze-running tendency. Here, 
then, emotionality would serve only to 
raise the general motivational level, 
and hence the maze-running perform- 
ance. 

If the above analysis of the effect of 
emotionally derived needs upon previ- 
ously learned instrumental responses 
carries over to consummatory re- 
sponses, the following experimental 
prediction can be made: that animals 
trained to drink in specific drinking 
cages for a given time each day, and 
trained to apparent maximum per- 
formance under such conditions, will 
show an increased rate of drinking 
when the drinking period is immedi- 
ately preceded by noxious stimulation 
administered in a situation other than 
the one in which the drinking occurs.‘ 


An experiment by Siegel and Stuckey (9), 
reported after the present data had been col- 
lected, is very similar to the present in its experi- 
mental i‘aspects. The hypothetical basis and 
interpretation are quite different. These differ- 
ences will be treated_in the discussion section. 
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Another experiment to be reported (2), 
tests the prediction that, under the 
drinking conditions outlined, the rate 
of drinking will decrease when the 
introduced emotionally derived need 
(anxiety) has been conditioned in the 
drinking situation. 


EXPERIMENTAL PROCEDURE 


Subjects and apparatus.—The subjects for 
this experiment were 20 female hooded rats from 
the colony maintained by the Psychology De- 
partment, State University of lowa. ‘Their 
ages ranged from 120 to 150 days at the start of 
the experiment. 

The apparatus employed consisted of (1) a 
battery of individual cages in which the Ss were 
confined for 10 min. each day for the measure- 
ment of water intake, and (2) a direct-current 
source connected to a grid for the delivery of 
electric shock. 

The drinking cages were arranged in a row. 
The floor, top, and front and rear walls of the 
cages were of hardware cloth; the sidewall parti- 
tions were made of plywood painted dark grey to 
eliminate the possibility of any facilitating or 
interfering effect of the drinking of one S upon 
that of another as a result of visual stimulation. 
The inside dimensions of an individual cage 
were 8 X 4 X 6 in. 

Shock was administered in a compartment of 
dimensions 74 X 4 X 34 in., which was fitted 
with a grid floor and plexiglass top. The walls 
of the shock compartment were of three-quarter 
in. pine, painted white. The source of electric 
current was a 6c6 pentode vacuum tube provided 
with a plate voltage of 650 volts. An intensity 
of 50 microamperes was employed throughout as 
the noxious stimulus. The nature of the shock 
circuit was such that variations in bodily resist- 
ance of the animals produced no measurable 
change in current. 

Habituation period—During this 10-day 
period the Ss were handled 10 min. per day by 
the experimenters. A drive rhythm based upon 
22 hr. of food deprivation and 12 hr. of water 
deprivation was established. 

Control period.—This was a 14-day period 
following habituation training during which daily 
measurements were taken of the amount of water 
intake for each S during a 10-min. period. 
Drinking performance was observed in the 
morning following 12 hours of water deprivation. 
Forty-five min. prior to drinking, each S was 
given four gm. of Purina dog chow in an individ- 
ual feeding cage. Water was not available at 
this feeding, so that a further increase in thirst 
presumably occurred. Approximately one-half 
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hour after the cessation of eating, the Ss were 
put into the drinking cages. ‘The water bottles 
were then weighed and inserted into the cages. 
After a period of 10 min., the water bottles were 
removed and re-weighed. The Ss were then 
fed the remainder of their 10-gm. diet and re- 
turned to the home cages. When 14 days of this 
kind of training had been carried out, the group 
drinking curve appeared to have reached a rela- 
tively stable asymptote. The experimental 
manipulation wac, therefore, introduced at this 
point. 

Experimental period.—During this three-day 
period the drive schedule was continued as 
before; however, a series of shock trials was given 
on each day between the prefeeding procedure 
and the measurement of water intake. Each 
S, at the completion of the four-gm. ration, was 
given four shock experiences. Each shock trial 
was of 20 sec. duration at an intensity of 50 
microamperes. The interval between shock 
trials was approximately 10 min. On each day 
of the experimental period, the Ss were placed 
in the individual drinking cages directly follow- 
ing the last shock experience and the water 
intake was measured as previously described. 
The remainder of the procedure remained 
identical with that during the control period. 
This procedure was followed for three days. 
Temperature readings were made on every day 
during the control and experimental periods. 
The small day to day temperature variations 
were found to be not significantly related to the 
average water intake over the relatively stable 
portion of the drinking curve. 

Switched period.—-This period followed im- 
mediately the experimental period and consisted 
of two additional days of drinking observation. 
On the first day of the switched condition the 
odd-numbered Ss were shocked prior to drinking, 
as during the experimental period; the .even- 
numbered Ss were not shocked, as during the 
control period. On the second day of this period 
the procedure was reversed: the even-numbered 
Ss were shocked while the odd-numbered Ss were 
not. This was designed to serve as an addi- 
tional check on the effect of shock-produced 


emotionality on drinking. 


Resutts anp Discussion 


Comparison of control and experimental 


periods 


The major findings are presented 


graphically in Fig. 1. This shows the 
mean water intake in grams of 20 
subjects during the 14-day control 
period and the three-day experimental 
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period. The curve of drinking, which 
had reached a relatively stable limit 
at about 5.7 gm. shows a sharp in- 
crease during the experimental period 
to a value of over seven gm. The rise 
of the control period drinking curve 
over the first nine days is probably due 
to the fact that on Day 1, the drinking 
situation was new to all Ss. It prob- 
ably mirrors the effects of two general 
factors: (1) adaptation of exploratory 
behavior which would compete with 
the drinking response, and (2) condi- 
tioning of drinking to the stimuli in 
the new drinking situation. 

In order to make a statistical test 
of the difference shown between the 
control and the experimental periods 
it was necessary to have a single score 
representing the drinking level of each 
S at the end of the control period and 
another single score for each S during 
the experimental period. To this 
end, a mean drinking score was com- 
puted for each S for the last three days 
of the control period, and another 
mean score was found for the three 
days of the experimental period. 
Table I shows these scores and the 
differences between pre-shock and 
post-shock drinking for each of the 
20 Ss. Examination of this table will 
reveal that in only two out of 20 cases 
was the pre-shock score higher than 
the post-shock score. The mean pre- 
shock drinking for 20 Ss was 5.78 gm.; 
the mean post-shock drinking was 
6.88 gm. An analysis of the differ- 
ence for related measures yielded a 
t-value of 3.42 (for 19 degrees of free- 
dom, the t-value at the one percent 
level of confidence is 2.86). A rea- 
sonably confident statement may, 
therefore, be made that the shock ad- 
ministered just prior to drinking re- 
sulted in a real increase in water in- 
take. This result is in line with 
the hypothesis as stated. 

An experiment recently reported by 
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Fic. 1. Mean water intake in gm. of 20 Ss on successive, daily, 10-min. drinking 
trials, under control and experimental conditions 


Siegel and Stuckey (9) tested certain phy- 
siological hypotheses regarding thirst 
and reported experimental findings which 
are essentially the same as the present. 
They concluded that emotionality in- 
duces a state of thirstinthe rat. In view 
of the fact that 27 out of 40 Ss in their 
experiment did not drink af a// over a 
two-hour period following faradic stimu- 
lation, such a conclusion may be seri- 
ously questioned. Of the Ss that did 
drink, in the Siegel and Stuckey experi- 
ment, those which had previously been 
shocked drank more than the control 
animals, which had not been shocked. 
This would seem to suggest, as has been 
hypothesized here, that emotionality in- 
creases not the thirst level but rather the 
general motivational level, which would, 
according to the introductory formula- 
tion, result in an increase in the strength 
of any concomitant response. Since, in 


the case of the present experiment, and 
that of Siegel and Stuckey, the drinking 
response was the aspect of the total re- 
sponse pattern which was measured, this 
response was found to have increased. 
One test of these somewhat conflicting 
interpretations would involve the meas- 
urement of food intake following faradic 
stimulation. If, as Siegel and Stuckey 
suggest, emotionality results in a direct 
inhibitory effect upon salivary flow with 
resulting dryness of the buccal cavity, 
and not to a general unspecific increase 
in the motivational level, then the in- 
gestion of dry food following faradic 
stimulation, where emotionality is not 
conditioned to competing responses, should 
be inhibited. According to the position 
taken here, emotionality would also serve 
to increase the rate and amount of food 
ingested under conditions similar to those 
of the present experiment. 
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Effect of shock and no-shock during 
switched period 


Each of the 20 Ss had one shock trial 
and one no-shock trial during the 
switched condition. The mean water 
consumption for the shock trials under 
the switched condition was 6.42 gm. 
For the no-shock trials, under this 
condition, the mean water intake was 
6.09 gm. Both the shock and no- 
shock mean values for the switched 
condition are larger than the pre- 
shock (control) mean, and smaller 
than the shock mean of the experi- 
mental period. A comparison of the 
shock with the no-shock trials of the 
switched condition, employing the 
t-test for related measures, yielded a 
t-value of 0.71. With 19 degrees of 
freedom the probability of occurrence 
of such a t by chance is more than 40 
in 100. The null hypothesis, with re- 
gard to the effect of shock during the 
switched condition, can, therefore, not 
be rejected. 

The failure of a significant differ- 
ence to be established between the 
shock and no-shock trials of the 
switched condition leads to some in- 
teresting conjecture regarding the con- 
ditionability of the response char- 
acteristics of emotionality to cues 
contiguous with them. If, during the 
experimental period, the perseverative 
emotionality from shock could be 
conditioned to the cues in the drinking 
situation, then, in the absence of 
shock before drinking, the drinking 
cage itself might elicit a conditioned 
form of the response of emotionality. 
If emotionality had previously been 
reduced by drinking, there might re- 
sult an increase in water intake during 
the no-shock trials of the switched con- 
dition over what it was during the con- 
trol period. Although the mean 
difference between pre-shock and no- 
shock (switched) was not found to be 
statistically significant (¢ = 1.05), the 
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TABLE I 
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Scores anp DirFERENCES FOR 
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difference (.32 gm.) was in the sug- 
gested direction. There is, then, some 
likelihood that the failure of the shock 
and no-shock trials of the switched 
condition to show a significant differ- 
ence in water intake may be attrib- 
uted to this factor of conditioned emo- 
tionality during the no-shock, switched 
condition trials. 

A more definite answer to the ques- 
tion, can emotionality (the persevera- 
tive emotional response from painful 
stimulation) become conditioned to 
contiguous stimuli?, must await fur- 
ther experimentation, wherein a 
greater number of trials are allowed 
for this conditioning to occur. In the 
present experiment, the emotionality 
response could only have been condi- 
tioned to the cues in the drinking 
situation during the three days of the 
experimental period. 


SUMMARY 


In order to test the hypothesis that 
the magnitude of a consummatory 
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(drinking) response would be _ in- 
creased when that response was pre- 
ceded by noxious stimulation admin- 
istered apart from the drinking situ- 
ation, 20 female albino rats were 
trained to drink in individual cages 
during a 10-min. period each day for 
14 days. When the average amount 
of water consumed had ceased to in- 
crease from day to day, and remained 
approximately constant for several 
days, a three-day experimental period 
was introduced. During these three 
days, electric shock stimulation im- 
mediately preceded the drinking 
period. After the experimental pe- 
riod, there was a further two-day pe- 
riod during which 10 of the 20 Ss were 
shocked before drinking on the first 
day, and the other 10 were shocked on 
the second day. This was termed the 
switched period. 

The introduction of shock before 
drinking in the experimental period 
resulted in a sharp and significant rise 
in the curve of average water con- 
sumption. On the basis of these re- 
sults, the hypothesis as stated could 
not be rejected. The results were 
interpreted in terms of an increase in 
the total generalized drive strength, 
and this interpretation was compared 
with an alternative interpretation of 
similar results by other experimenters. 

The data from the switched condi- 
tion did not show a statistically sig- 
nificant difference between the amount 
of water consumption with and with- 
out prior shock experience. These 
results were interpreted as indicating 
that emotionality (the perseverative 
aspects of the pain state) may become 
conditioned to stimuli which are con- 
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tiguous with that perseverative state 
although not contiguous with the 
shock experience itself. 


(Manuscript received September 2, 1949) 
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RESPONSE LATENCY AT ZERO DRIVE AFTER VARYING 
NUMBERS OF REINFORCEMENTS! 


BY DAVID ZEAMAN AND BETTY J. HOUSE 


University of Connecticut 


INTRODUCTION 


The present experiment was de- 
signed to test a specific theory, but 
the data may be relevant to any the- 
ory which attempts to account for be- 
havior in terms of previous learning 
and present motivation. 

The hypothesis to be tested is Corol- 
lary II (12, p. 247) of Hull’s Postulates 
5, 6, and 7 which states: 

Many experimental tests of this 
corollary can be made—at least as 
many as there are combinations of 
different measures of sEp and drive. 
Some of these combinations have been 
tried by other investigators with con- 
tradictory results. The particular 


combination used in this study has 


been chosen, not only because it was 
previously uninvestigated, but also 
because it obviates certain difficulties 
reported with others. 

A descriptive summary of the rele- 
vant experimentation cannot be given 
here. The literature is large (2, 4, 7, 
Q, II, 15, 17, 19, 21, 23, 24, 25, 26, 29) 
and much of it current. 


A critical summary of the experimental’ 


evidence leads to the conclusion that a 
number of difficulties are inherent in the 
type of research designed to discover the 
strength of apparently unmotivated be- 
havior. 

The first of these is that of finding a 
set of empirical operations which may be 
said by convention to create zero drive, 
when the drive is to be regarded as an 


! This experiment was performed in the psy- 
chological laboratory of Brown University. 

When primary drive strength (D) is zero, 
reaction-potential (sEg) has an appreciable but 
relatively low positive value which is a positive 
growth function of the number of reinforcements. 
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inferred state of the organism conceptual- 
ized as a theoretical construct ‘drive’ or 
D. That there is no adequate conven- 
tion as yet is attested to by the wide vari- 
ety of elaborate satiation procedures 
previously used to reduce the primary 
appetitive drives to zero. Zero level for 
some of the primary aversive drives such 
as the light aversion drive in the present 
research provide an easier problem, but 
there has been little experimentation to 
take advantage of this fact. 

The second difficulty is similar to the 
first. It is that of assigning to some em- 
pirical observation of behavior the status’ 
of anchor for zero amount of response 
strength which may be conceptualized, 
as in Hull’s system, reaction potential or 
sEr. If a theory asserts that there will 
be strength of behavior under zero drive 
and at the same time does not establish 
explicit operational definitions which will 
tie response strength and zero drive to 
some operations or observations, then no 
clear-cut relationship exists between 
theory and data. 

The difficulties are not insurmount- 
able. It is necessary only to try out 
various conventions until some are hit 
upon which bring about the happiest 
relation between theory and data. 
Many existing conventions relating oper- 
ations of satiation to the magnitude of 
resultant drive have not proven to be the 
best of choices. They do not make clear 
the grounds for deciding which of any 
two satiation methods produces the lower 
degree of drive. Why, for example, does 
interrupted access to wet mash produce 
lower hunger drive than continuous ac- 
cess todry pellets? Because one method 
gets more food into the animal? Or be- 
cause one produces lower resultant re- 
sponse strengths? Both or either of 
these grounds may be implied when ex- 
perimenters describe their satiation 
methods as ‘more rigorous’ or ‘more ex- 
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treme,’ but they are not explicated. Ex- 
plicit adoption of either of these grounds 
does not completely solve the problem. 
If we scale D by the amount of food a 
given procedure can get into the animal, 
what amount shall serve to indicate zero 
drive? The rigorous satiation proce- 
dures used by Koch and Daniel (17) and 
by Webb (26) might be offered as con- 
vention to be given the status of zero 
hunger drive. However, it may be pos- 
sible through other extreme methods, 
which might make additional use of social 
facilitation (10), specific appetites (28), 
or punishment of not-eating behavior, to 
get even more food into the animal. In 
these cases, what value of drive is to be 
assigned to these operations? It is rele- 
vant in the context of the present study 
on light drive to point out here that such 
a problem does not arise in conjunction 
with some aversive drives. If we iden- 
tify the strength of a drive with the 
strength of some external stimulus, we 
may offer as convention a zero amount of 
this stimulus as corresponding to zero 
drive strength. For example, with an 
aversive light drive, no quanta of light 
would correspond to no drive at all. It 
is possible that people can go on finding 
more and more ingenious ways of further 
food-satiating an animal, but it is un- 
likely that anyone will wish to go further 
than no quanta of light at all as a meas- 
ure of zero light drive. 

Returning now to a consideration of 
the second implicit ground for anchoring 
D mentioned above, that of scaling D by 
the ‘amount of response strength pro- 
duced by a given satiation procedure: if 
one feeding method is stated to be more 
rigorous than another because it de- 
presses behavior further, it is implied 
that zero drive would be associated with 
that method entirely eliminating be- 
havior. Such a convention would, of 
course, make the relation between zero 
drive and response strength (sER) a non- 
empirical matter. A definition of zero 
drive as one producing zero sEg would be 
logically inconsistent with Hull’s theory 
which postulates that zero drive will not 
entirely eliminate behavior strength, 
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hence it could not be used very well in 
an attempt to test Hull's theory. 

This does not mean that an aspect of 
behavior cannot be used to infer the 
strength of a drive produced by some ex- 
perimental operation. Circularity will 
be avoided simply by inferring zero drive 
from an aspect of behavior other than 
the one under investigation. The drive 
state bears many relations to behavior. 
The presence of it not only maintains 
behavior (the sHe X Drelationship) but 
the reduction of it can strengthen be- 
havior under appropriate (reinforcing) 
circumstances. One might offer as con- 
vention for zero drive that level of drive 
the reduction of which produces no 
measurable learning under otherwise 
optimal conditions. The experimental 
operation associated with such a level 
can then be carried over to already exist- 
ing, relevant habits of various strengths 
to find out whether zero level of drive so 
defined would reduce performance to 
zero level. 

A practical illustration of this method 
applicable to light drive may make the 
notion clearer. A rat is put into a cage 
wired in such a way that his running to 
one side of the cage turns out an over- 
head light. If the light is very bright, 
he will learn to prefer the side which 
brings about the dark. Different rats 
may be tried under various lesser degrees 
of illumination until one is found too dim 
to produce learning. In this way, a 
threshold of reinforcement by light drive 
reduction is found. Such an experiment 
has already been performed by Hanson 
(9), who found a reinforcement threshold 
between .135 and .os5 millilamberts. 
The rats spent equal time in the light 
and dark at this level of illumination. 
Now if this threshold level (or less) of 
light intensity is carried over into the 
present experiment after varying amounts 
of training have already taken place un- 
der strong drive, the question of the joint 
effect of zero drive (response defined) and 
varying habit strengths on performance 
may be checked. Similar techniques 
are conceivably employable with any 
primary drive, appetitive or aversive. 

Establishment of an adequate zero 
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drive convention was the first difficulty 
seen in the literature on the present prob- 
lem. Of equal importance is the problem 
of assigning the value of zero response 
strength or reaction potential to some 
empirical observations. In the litera- 
ture cited above, the following measures 
are some that have been interpreted as 
indices of zero strength: chance perform- 
ance on a maze, the level of strength pro- 
duced by extinction, criterion latencies, 
zero number of responses to an extinction 
criterion, a ‘negative’ number of extinc- 
tion responses, and the unconditioned 
level of strength. With so large a num- 
ber of ‘zero’ points, contradictory results 
are to be expected. In the interests of 
parsimony and consistency some one of 
these should be elevated to the status of 
accepted convention. The last of these 


conventions, the unconditioned strength 
—what Skinner calls the free operant 
level—has the most to recommend it. 
It is the most general convention in that 
it will yield a ‘zero’ level for many meas- 
ures of response strength such as prob- 
ability, latency, frequency, amplitude 


and so on; and it ties down ‘zero’ level of 
reaction potential to independent meas- 
ures of behavior. Estimates of the un- 
conditioned strength can be determined 
empirically for both the Pavlovian and 
selective type learning situations. 

‘The operant level criterion of zero re- 
action potential, as stated, is ambiguous 
in that two different methods of measure- 
ment are possible. As a first possibility, 
some single set of experimental conditions 
could be set down as the standard condi- 
tions under which operant level should be 
measured. For example, the untrained, 
unacclimatized animal could be put into 
the experimental situation under maxi- 
mum drive for a given period of time or 
number of sessions to allow sampling of 
his unconditioned ‘zero’ strength. One 
value of unconditioned strength for each 
response measure would be obtained by 
this method. The second possibility 
would require the measurement of oper- 
ant level under many conditions and 
would undoubtedly result in a shifting 
‘zero’ point. In a given experiment, the 
effects on operant level of any experi- 
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mental operations or conditions other 
than those involving reinforcement would 
have to be determined. Changes in the 
strength of relevant or irrelevant drives, 
for example, may operate to change un- 
conditioned response level. Also, con- 
tinued time or trials in the situation 
without reinforcement before learning 
may result in either an increment in un- 
conditioned strength, as Mote and Finger 
(18) have demonstrated for a running 
response, or in a decrement, as shown by 
Schoenfeld, Antonitis, and Bersh (22) for 
a lever-pressing response. The second 
convention of a shifting zero point is the 
more appropriate for a reinforcement 
theory on the grounds that it partials out 
of the total variation in performance 
changes in strength not attributable to 
the usual, independent theoretical vari- 
ables. Age, sex, and heredity are ex- 
amples of variables that as yet play no 
formal part in reinforcement theory. 
Their effects are usually partialled out 
statistically or experimentally. Changes 
in unconditioned strength belong in this 
category of as-yet-not-theoretical vari- 
ables and should therefore also be can- 
celled out. Adoption of the shifting 
zero point would provide a control for 
this factor. As an example illustrating 
the need for such a control, a recent ex- 
periment (26) may be cited in which vari- 
ous strengths of irrelevant drive served 
as the independent variable. The re- 
sulting increments in performance 
strength were attributed to the energiz- 
ing effects of the irrelevant drive on the 
strong habit structures that were present. 
Whether or not similar increments in 
performance strength would have oc- 
curred in the absence of habit structure 
was not determined. A necessary con- 
trol for this interpretation would have 
been the measurement of operant level 
under each of the irrelevant drive condi- 
tions and the determination of the differ- 
ence between the conditioned and uncon- 
ditioned levels. Use of this difference as 
a measure of reaction potential would, of 
course, amount to adoption of the shift- 
ing zero point convention. 

A difficulty in measuring operant level 
for a zero point may appear in those cases 
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where the conditioned response before 
learning is not sufficiently strong in the 
presence of the conditioned stimuli for 
any immediate aspect of it to be meas- 
ured. Under these circumstances, some 
investigators have measured the uncondi- 
tioned strength by the number of rein- 
forcements required to bring some aspect 
of the response past its threshold into the 
experiment. Measuring unconditioned 
level from the negative or below-thresh- 
old strength is not difficult in the Pav- 
lovian set-up because reinforcements 
can be counted by stimulus pairings be- 
fore the CR appears, but a problem 
arises in operant conditioning where the 
reinforcements are contingent upon the 
presence of the response from the outset. 
How can a _ negative unconditioned 
strength be computed here? Although 
the rat usually comes into the typical 
operant (lever-pressing) situation with a 
low measurable rate of response, some 
investigators arrive at a below-threshold 
strength in the following way. If the 


CR fails to occur within a given period 
of time after the presentation of the CS 


(the lever), the response is forced by 
baiting the manipulandum with food. 
The attempt to get at the bait brings 
about the response, which is then rein- 
forced with a food pellet and counted as 
an ‘unconditioned’ reaction (§). The 
number of such reinforcements required 
to bring the response within the set time 
limit measures the amount of below 
threshold strength and hence the dis- 
tance from no responses at all down to the 
‘true’ zero of strength. As an independ- 
ent empirical support for this measure of 
negative strength, Hull (12, p. 325, 8) 
offers the negative strength arrived at by 
extrapolation of certain learning curves 
to the region of zero number of reinforce- 
ments.? It might be recommended here 


*Full recognition is apparently not always 
given to the fact that Hull explicitly adopts this 
convention allowing zero sEg to be anchored to 
a negative response strength. When, for ex- 
ample, Koch and Daniel (17) found a median of 
one extinction response at ‘zero’ drive, they re- 
garded this strength as strikingly close to zero 
reaction potential and hence contrary to Hull’s 
theoretical prediction of a greater than zero 
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that if a time limit is used in a criterion 
of presence or absence of response in a 
measurement of operant level before 
learning, the same time limit be used in 
resistance-to-extinction criteria. This is 
not generally the case. Otherwise, this 
method can be considered as an adequate 
one of assessing the free operant level to 
conventionally represent zero reaction 
potential. 

In summary, the difficulties apparent 
in this area of research are primarily 
those associated with the absence of 
established conventions for zero drive 
and zero reaction potential. For such 
a problem there is no solution other than 
the continued use by investigators of 
explicit conventions which are general, 
consistent with theory, productive of un- 
equivocal results, and technically feasi- 
ble. 

The two conventions respected in the 
present research are a response-inferred 
zero point for drive and the uncondi- 
tioned level of strength as zero point for 
reaction potential. Explicating more 
fully, zero drive is that level of drive the 
reduction of which cannot produce learn- 
ing (increase in reaction potential) in an 
otherwise optimal reinforcement situ- 
ation; and zero reaction potential is the 
positive or negative strength of perform- 
ance measured before reinforcement has 
begun. 

It is the purpose of the present research 
to test the part of Hull's theory stated at 
the outset, using these two conventions. 


PRocepURE AND APPARATUS 


Subjects.—The subjects were 38 albino rats 
from the colony of the Brown laboratory, 21 
males and 17 females. Their ages ranged from 


reaction potential at zero drive. Their assump- 
tion was that no-extinction-responses-within- 
five-min. was equal to zero sEp, in contrast to 
Hull’s convention which would make zero ex- 
tinction responses equal to greater than zero 
sEg. Hence, although Koch and Daniel’s data 
prove that the Hullian equations overestimate 
response strength at ‘zero’ drive, they do not 
controvert the part of the theory that predicts 
greater than zero gEpz at zero drive. The 
reason, as has been pointed out, lies in this par- 
ticular convention relating zero sEp to behavior. 
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100 to 150 days at the beginning of the experi- 
ment. Half of the rats had been used previously 
in a panel-pushing experiment involving a food 
reward. Asa group, their performance was not 
distinguishable from that of the naive rats. The 
differences in sex and previous training were 
randomized among the experimental conditions. 

A pparatus.—Essentially, the apparatus con- 
sisted of a two-compartment box, with a swinging 
door which allowed the rat to go from one com- 
partment to the other. Between trials the 
swinging door could be enclosed by a double 
sliding panel. A light source was so arranged 
that either compartment could be darkened 
while the other remained illuminated, or both 
compartments could be made light or dark simul- 
taneously. 

The principal features of the apparatus are 
shown in Fig. 1. The lower part of the drawing 
shows the experimental box, with dimensions 
eight in. wide, two ft. long, and eight in. high. 
The partition across the center of the box divides 
it into two compartments 8 X 12 X 8 in. A 
doorway, two in. wide and three in. high, was 
placed in the middle of the partition. A thin 
piece of metal of the same dimensions as the 
doorway swung freely so that a rat could easily 
push his way from one compartment to the other. 
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A double wooden panel covered the swinging 
door between trials. This panel moved in two 
sets of grooves and could be raised or lowered by 
the experimenter by means of a string. When 
lowered, it enclosed the swinging door so that be- 
tween trials the rat in either compartment was 
presented with a smooth wooden panel over the 
door. The animals were placed in the apparatus 
and removed through the two doors in the ends 
of the box. The box was constructed of Mason- 
ite with a hardware-cloth floor. Set into the 
front of the box was a piece of glass 20 X 7 in. 
which allowed the experimenter to observe the 
rat’s behavior. The top of the box was covered 
by opal flash glass. The inside of the box was 
painted white throughout, except for the swing- 
ing door, which was painted black. The box 
was painted flat black on the outside. 

The upper part of Fig. 1 shows the wooden 
framework supporting the light source. Two 
300-watt bulbs with built-in reflectors were 
centered over the two compartments of the box. 
Below each of the bulbs was a Pyrex cake plate, 
8 X 11 X 24 in., filled with water to absorb the 
intense heat. Next in sequence was a sliding 
filter holder. For the present experiment, the 
filter holder contained a sheet of aluminum metal 
foil. In the figure, the sheet of metal paper 
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Fic. 1. The experimental apparatus described in the text. 
partment, he runs through the door into the dark compartment as soon as the panel is raised. 
black cloth and metal paper have been torn away to show the inside of the box. 
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When the light is on in the rat’s com- 
The 


Not shown are the 


two layers of light-diffusing glass directly on top of the compartments, the pulley arrangements for 
the filter holder and panel, and the electrical wiring. 
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covers the right compartment and cuts off the 
light leaving the compartment dark, while the 
left compartment is illuminated. The experi- 
menter, by pulling a string attached to the left 
end of the filter holder, could slide the sheet of 
metal paper over the left compartment, leaving 
the right side illuminated. Directly beneath the 
filter holder was a sheet of frosted glass. The 
frosted glass, together with the opal flash glass 
covering the compartments, acted to diffuse the 
light from the two bulbs. The white walls of 
the inside of the box provided an environment of 
relatively constant illumination. The bright- 
ness of a lighted compartment was 743 milli- 
lamberts, as measured by the Macbeth Illuminom- 
eter. A dark compartment, shielded by a 
layer of metal paper, had a brightness of .007 
millilamberts. The wooden framework contain- 
ing the light source was covered on three sides by 
aluminum metal foil painted flat black on the 
outside to radiate as much heat as possible. The 
front of the structure was draped with several 
layers of black-out cloth enabling easy access to 
the water pans for frequent re-filling. Enough 
of the bulb-heat was thus radiated off and ab- 
sorbed to keep the temperature inside the two 
compartments within one degree of that of the 
room over a period of five hours. 

Recording system.—The apparatus was wired 
for electrical recording of latencies. A micro- 
switch on each side of the box could be closed or 
opened by movement of the floor of the box 
which tilted slightly under pressure. ‘Thus, if a 
rat were on the right side, the right floor-switch 
was closed, and the left floor-switch was open. 
Either of these two switches could be placed in 
series with a Springfield timer by means of a 
single-pole double-throw switch. For example, 
if the rat were on the right side of the box, Ecould 
prime a circuit joining the timer with the right 
floor-switch by throwing the single-pole double- 
throw switch to the right. The circuit thus 
formed would be closed if the rat were on the 
right side of the box, and if the sliding panel were 
raised. Raising of the panel also gave the rat 
access to the swinging door and constituted the 
beginning of a trial. When the rat pushed open 
the swinging door and went into the next com- 
partment, the right floor-switch was opened, 
breaking the circuit and stopping the clock. 
Since the closing and opening of the circuit coin- 
cided with the beginning and end of a trial, the 
latency of response could be recorded from the 
timer. 

Learning—under drive.—Learning trials were 
conducted in the following manner. A rat was 
placed in a dark compartment. A few sec. were 
allowed for the experimenter to close the door of 
the compartment and to make sure that the 
switches were set correctly. When E was ready 
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to begin a trial, he pulled the filter holder, mov- 
ing the sheet of metal foil from one compartment 
to the other, thereby illuminating the compart- 
ment containing the rat. Two sec. later, E 
raised the sliding panel, exposing the black swing- 
ing door and starting the Springfield timer. If 
the rat pushed his way into the dark side of the 
box, the floor-switch was opened and the clock 
stopped. If the rat made no response within 
five min. after the raising of the panel, he was 
gently pushed through the swinging door and his 
latency recorded as indeterminate. In either 
case, E lowered the sliding panel and allowed the 
rat 60 sec. reinforcement in the dark compart- 
ment. During this period, he recorded the 
latency and re-set the clock to zero. After 60 
sec., the single-pole double-throw switch was 
thrown to the side containing the rat, and the 
filter holder was pulled to the other side so that 
the rat was now in the light. Two seconds 
later E raised the sliding panel to begin the next 
trial and to start the clock. 

The learning trials were given in daily sessions 
of 10 trials. Five experimental groups were 
brought to different levels of learning by varying 
the number of learning trials. Six rats were 
given 10 learning trials, 15 rats had 20 trials, 
eight rats had 40 trials, four rats had 70 trials, 
and five rats had 1oo trials. ‘The group contain- 
ing the 15 rats was run as part of another experi- 
ment requiring identical conditions. For the 
other groups in the present experiment, smaller 
numbers of rats were found to give as regular 
results. 

No drive—after learning.—On the day follow- 
ing the learning trials, each experimental group 
was given 10 test trials to determine strength of 
response under no drive. The same procedure 
was followed for test trials as for learning trials, 
with the obvious difference that during these 
trials both of the compartments were dark. 
Thus the 60-sec. period between trials did not 
serve as a primary reinforcement. Response 
strength under no drive for the five experimental 
groups was determined after 10, 20, 40, 70, and 
100 learning trials, respectively. 

No drive—before learning.—As a control, eight 
untrained rats were given 10 trials a day for 10 
days with both compartments dark. These 
trials were conducted with exactly the same 
procedure as the test trials described under the 
section labelled ‘No drive—after learning.’ In 
this way, there was determined the uncondi- 
tioned operant level of response under zero drive 
and the effect on this level of continued ‘elicita- 
tion’ without reinforcement. 

Drive-—before learning.—As an additional con- 
trol the unconditioned operant level under the 
intensity of drive used in the experiment was 
determined. Six of the same rats used in the 
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TABLE I 
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* For the values in this column, the number of the daily block of 10 trials refers to the number of 


previous blocks of learning trials under drive. 


** Criterion values representing ‘greater-than-five-min.’ 
*** This group of rats was run at an unscheduled hour on this day. The data were discarded. 


first control group were given 10 trials a day for 
seven days with both of the compartments 
lighted. ‘These trials were given two weeks after 
the last of the ‘No drive—before learning’ trials. 


RESULTS 


The major results of this investiga- 
tion are given in Table I and repro- 
duced graphically in Fig. 2. 

Learning—under drive.—The curve 
labelled ‘Learning—Under Drive’ rep- 
resents the course of latency change 
produced over 100 trials when a light 
intensity of 743 millilamberts pro- 
vided the drive and a reduction of this 
light to a level of .007 millilamberts 
provided the reinforcing situation (or 
reinforcement). The negatively ac- 
celerated decreasing shape of the curve 
is that usually found in learning curves 
when latency is the measure of re- 
sponse magnitude. The amount of 
change, one log unit, is relatively 
small, however, and may be attrib- 
uted to either the relative massing of 
trials or the weakness of the light 


drive? The major part of the latency 
change occurs within the first $0 trials. 
The last 40 latencies are relatively 
asymptotic. There is a considerable 
amount of variability in this function 
despite the fact that each point repre- 
sents the median of at least 50 values. 

No drive—after learning.—The 
curve labelled ‘No Drive—After 
Learning’ supplies a confirmation of 
the hypothesis which this investiga- 
tion sought to test, that under no 
drive, performance would be a positive 
growth function of the number of rein- 
forcements. If one assumes an in- 
verse relationship between latency 
and strength of performance, the con- 
firmation of the hypothesis is clearly 
suggested. 

It is important to note that for this 
function the abscissa refers to the 
number of previous blocks of learning 


* Experiments by Keller (15) and Hanson (9) 
found behavioral maxima at much lower intensi- 
ties of light drive. Their experiments will be 
discussed in a later section. 








LATENCY AT ZERO DRIVE 


trials under drive. The function was 
plotted in this way so that the vertical 
distance between it and the learning 
curve would represent the change in 
latency at each stage of learning pro- 
duced by the removal of drive. The 
log median latency at each of the five 
stages of learning is lower than that 
preceding it. There are no inversions 
of the points. The relatively smooth 
and progressive decrement of the five 
points is perhaps the strongest argu- 
ment that can be made for there being 
a true function. A Festinger d-test 
(6) shows that the two extreme groups 
are significantly different between the 
.o§ and .o1 level. The large and con- 
sistent difference between the ‘No 
Drive—After Learning’ curve and the 
*‘Learning—Under Drive’ curve makes 
statistical test of the difference super- 
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fluous. Inspection of the ‘No Drive 
—After Learning’ curve might be all 
that is required to show that Hull’s 
hypothesis would predict both the 
position and form of the curve drawn 
through the experimental points. 
The details of the relation between 
theory and data will be given in the 
Discussion. 

No drive——before learning.—-The sep- 
arate group of animals who ran from 
dark compartment to dark compart- 
ment had their latencies undergo 
change described by the curve labelled 
‘No Drive—Before Learning.’ There 
is essentially no systematic change in 
these latencies. The  best-fitting 
straight line as determined by the 
method of averages shows a slight 
negative slope. The slope constant of 
the equation was —.02, the intercept 





2.74 


ORI VE- BEFORE LEARNING 
Q fe} 








LOG MEDIAN LATENCY 





_— 


LEARNING UNDER DRivE 


NO ORIVE-BEFORE LEARNING 
e 











3 4 


b) 6 7 8 9 10 


DAILY BLOCK OF TEN TRIALS 


Fic, 2. A graph of the data of Table I. 
under drive and no drive. 


The two topmost functions represent the operant level 
The lowest set of points is a learning curve and the function on top of it 


shows the effect of removal of drive at varying stages of learning. With the exception of one, all 
points represent the median of at least 50 measurements. 
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2.23 log sec. The amount of decre- 
ment produced by this negative slope 
over the 100 is slight compared to that 
produced by learning. 

Drive—before learning.—Serving as 
another control group, those animals 
run from lighted compartment to 
lighted compartment yielded latencies 
which varied in a manner shown by 
the curve marked ‘Drive—Before 
Learning.’ Again, no _ systematic 
change is evident during 80 trials. A 
straight line fitted to these data shows 
a slightly rising slope and an ordinal 
position above that of the ‘No Drive— 
Before Learning’ curve. The effect 
of the light is to depress responding in 
this situation. The effect is signifi- 
cant as shown by the fact that all 
seven of the points representing oper- 
ant level under drive are higher than 
the corresponding points representing 
operant level under no drive. There 
are two reasons why this difference 
may be interpreted as a true difference 
due to the effect of the drive. The 
first rules out the possibility of group 
differences between the rats, since the 
same group was run under both con- 
ditions. The effect of being in the 
situation under no drive conditions is 
to decrease latency. Hence, if any, 
the effect of using the same group for 
both conditions would be to operate 
against the effect demonstrated. ‘The 
second reason considers outside evi- 
dence. There has been considerable 
experimentation which shows that 
activity in the rat decreases signifi- 
cantly in the presence of light (13, 14, 
20). It might be noted here that one 
of the usual defining conditions for the 
presence of animal drive is the increase 
in activity level under the presumed 
drive condition. Here we have evi- 
dence to the contrary. A stimulus 
condition has been clearly demon- 
strated to be a drive condition in that 
its reduction acts as a reinforcement. 
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This same drive condition has been 
shown to decrease activity when no 
reduction is brought about by the ani- 
mal’s responses. The decrease in 
activity, i.e., increase in latency, under 
drive is actually greater than that 
shown in Fig. 2, since three of the 
plotted points have the indeterminate 
value of ‘greater than five minutes.’ 
The straight line fitted to the points 
has little quantitative meaning other 
than a rough indication that the oper- 
ant level under no drive is consider- 
ably higher than under drive. 


Discussion or REsuULTs 


A direct confirmation of Hull’s 
Corollary II is to be seen in the 
curves of Fig. 2. Assuming the adop- 
tion of certain conventions relating 
theoretical and experimental terms, 
the corollary predicts the negatively 
accelerated form and intermediate 
position of the curve labelled ‘No 
Drive—After Learning.’ The con- 
ventions tying theory to data are 
these: 


1. That the median log latency is 
some simple inverse measure of re- 
action potential (sEx); 

2. That the light intensity is a di- 
rect measure of drive (D), and that the 
degree of darkness employed was equal 
to zero drive; and 

3. That the two measurements of 
unconditioned or free operant strength 
(labelled ‘Drive—Before Learning’ 
and ‘No Drive—Before Learning’) are 
estimates of zero reaction potential 
(sEr) under two different drive con- 
ditions. 


If these equivalences are accepted, 
one may translate the graph into 
verbal form closely approximating the 


corollary under test. That there is 
‘appreciable’ strength without pri- 
mary motivation is empirically estab- 
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lished by the large area between the 
‘No Drive’ curves before and after 
learning. The extent of this area 
brings into question that part of the 
corollary which states that without 
motivation sEz will have a low posi- 
tive value. The strength is equal to 
approximately 50 percent of the max- 
imum strength produced by learning 
under drive. Maximum strength, us- 
ing our convention of zero sEp, is the 
area between the operant level under 
drive and the asymptote of the learn- 
ing curve under drive. It should be 
recognized, however, that these area 
relationships are dependent upon the 
transformation used to convert raw 
latencies to reaction potential. Other 
transformations than the median log 
latency, such as reciprocal latency, log 
reciprocal latency, geometric mean, 
etc., radically alter the percentage of 
maximum obtained, and even the 
form of the relationships. Independ- 
ent of such transformations is one 
estimate of the magnitude of strength 
under no drive after learning. This 
would involve the comparison between 
the number of trials required under 
conditions of drive and no drive to 
produce the same level of strength. 
For example, after 100 reinforcements, 
removal of the drive reduces strength 
of performance to a level equal to 
that produced by 20 trials of learning 
under drive. This comparison would 
yield a percentage of 20 percent, 
which might more easily be inter- 
preted as a low value. These values 
of 50 and 20 percent give substance 
to that part of the corollary predicting 
a relatively low positive value under 
no drive. 

The verbal content of the corollary 
is completed with the statement that 
under nodrive “sEpz . . . is a positive 
growth function of the number of rein- 
forcements.”” The negatively accel- 
erated shape of the ‘No Drive—After 
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Learning’ curve may be derived from 
this statement, and hence supports it. 
Thus all parts of the corollary are seen 
to have some basis in fact. There re- 
mains the task of examining the ac- 
ceptability of the conventions adopted, 
the relation of the results to previous 
findings, and alternative theoretical 
explanations. 

The goodness of a set of internally 
consistent conventions is partially 
determined by the consequences of its 
adoption fora particulartheory. The 
use of the log median latency as a 
simple inverse measure of reaction po- 
tential is justified by the large and 
growing number of studies which have 
added to the validity of many of 
Hull’s postulates and corollaries by 
employment of this transformation. 
In the present study, not only was a 
particular corollary confirmed, but the 
negatively accelerated appearance of 
the log latency learning curve is con- 
sistent with Hull’s postulates con- 
cerning the growth of habit strength. 
These quantitative properties would 
be undisturbed if the distances be- 
tween the operant level and the post- 
learning performance curves were 
plotted, thereby converting the func- 
tions into direct rather than inverse 
measures of sEp. 


The advantage of a latency measure in 
the investigation of the present problem 
is derived partly from the disadvantages 
of the resistance to extinction measure 
most frequently employed in previous 
zero drive experiments. The measure- 
ment of the number of extinction re- 
sponses to a criterion requires a period of 
time during which changes in drive may 
take place. Kimble (16) and Horen- 
stein (11) have shown that drive increases 
very rapidly after food-satiation as indi- 
cated by changes in latency and amount 
of food intake. A further disadvantage 
of resistance to extinction measures is 
that a limited number of measurements 
can be obtained from each animal. In 
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the present experiment, ten latencies were 
obtained from each animal under no 
drive. These are practical advantages 
serving to recommend latency as a proper 
measure to be used in testing the zero 
drive corollary. 

The convention relating light intensity 
directly to drive (D) has precedence in 
the experiments of Keller (15) and Han- 
son (9). They found that progressive 
increases in light intensity had behavioral 
effects which would lead us to infer that 
D had undergone progressive increments. 
Keller reports that three different meas- 
ures of strength of a lever pressing re- 
sponse were related to the intensity of 
light used as a drive. Two measures of 
frequency of response increased with 
negative acceleration as the light intens- 
ity increased from 17 to 514 millilam- 
berts. Latency of response decreased 
with negative acceleration over the same 
range. The asymptotes of response 
strength occurred at approximately 340 
millilamberts, an intensity well below the 
743 millilamberts used in the present 
experiment. Hanson has found that in 
a two-choice light-dark situation rats 
averted a light intensity of 6.8 millilam- 
berts 93 percent of the time in the situ- 
ation. As the light intensity was gradu- 
ally decreased the rats’ light-dark choice 
approached the chance level. No prefer- 
ence between light and dark was found 
when the light intensity reached .os 
millilamberts. This value of light in- 
tensity approximates zero drive accord- 
ing to our previously stated convention 
of zero drive as that incapable of produc- 
inglearning. The level of darkness actu- 
ally used in the present experiment (.007 
millilamberts) was well below the rein- 
forcement threshold and approached a 
stimulus specification of absolute zero 
drive, defined as no light at all. The 
adoption of these operational definitions 
relating drive and light intensity allows 
the logical incorporation of the data of 
the light drive experiments within Hull’s 
theory and provides a method for a clear- 
cut test of the corollary under investiga- 
tion. 

Use of the light drive is perhaps more 
suited to this general problem area than 
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the more widely used appetitive drives 
for two reasons. Since the problem 
concerns primary drive and primary rein- 
forcement, the possibility of secondary 
reinforcement effects must be eliminated. 
The secondary reinforcing aspects of 
food and water rewards make this factor 
difficult of control. It is unlikely, how- 
ever, that the animal will come into an 
experimental situation with any second- 
ary reinforcing associations connected 
with the light drive. The difficulty of 
independent experimental control of 
hunger and thirst constitutes a second 
drawback of the appetitive drives. Dep- 
rivation of water may, for example, pre- 
vent complete satiation of food, but the 
light drive does not interact in a similar 
fashion with other drives. 

The last convention unifying theory 
and data is the shifting zero point con- 
vention. The consequences of its adop- 
tion were elaborated in the Introduction 
of this paper. 

Considering now, the relation of our 
results to those of previous studies, agree- 
ment is found with all those studies (2, 
4, 7, 15, 24, 25, 26, 29) which have re- 
ported the presence of performance 
strength under zero drive. Our results 
disagree with those studies (17, 23) which 
did not. For the specific problem of 
strength under zero drive after varying 
numbers of reinforcements, Siegel’s ex- 
periment (23) is the most closely related 
to the present study. Some of his re- 
sults can be interpreted as consistent 
with those of this experiment, and some 
as contradictory. The experiment most 
similar to the present one in its use of 
aversive light drive is that of Keller 
(15). He also found that removal of 
drive after learning left performance 
stronger than operant level. 

The majority of studies concerned with 
zero drive indicated a presence rather 
than an absence of behavior strength. 
For the most part, those reporting 
negative results have used appetitive 
drives, with extreme satiation procedures. 
These may involve an artifact. Provid- 
ing conditions for the animals to ‘over- 
stuff’ themselves could produce a nega- 
tive drive state leading to ‘resting’ re- 
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sponses as a means of reduction. One 
consequence of such an inferred drive 
state would be the interference with the 
truined response otherwise observed at 
zero hunger drive. The kind of evidence 
needed to further support the ‘stuffed- 
state’ inference would be a sharp decline 
in operant level at some point along the 
satiation continuum and/or a sudden 
interference with strong habits motivated 
by other drives at a similar point in food- 
satiation.‘ 


In interpreting the results of this 
study, it has been demonstrated that 
these confirm the verbal content of 
the corollary of Hull’s theory under 
test. It is also true that nothing in 
the results would negate the postu- 
lated mechanism from which the corol- 
lary wasderived. The strength under 
zero drive is postulated to reflect the 
sensitization of habit structure by ir- 
relevant drives. As a matter of fact, 
the present research is one of the few 
which shows that habit structure is 
necessary for the demonstration of 
apparently unmotivated behavior. 
Whether or not the habit structure is 
energized by irrelevant drives is not 
tested by our experiment, although 
this could most easily be done with the 
light drive technique, uncomplicated 
as it is by physiological interaction 
with other drives. 

Various other interpretations of the 
data could, of course, be made. The 
results are consistent with Anderson’s 
notion of an externalized drive (3), 
with Allport’s functional autonomy 
(1), and with Woodworth’s idea of a 
mechanism as a drive (27). Explan- 
ations of the results may possibly be 
formulated in terms of secondary rein- 
forcing or secondary drive stimuli, 
although such explanations would be 
restricted to response produced or 

4 The frequently heard bit of dinner table con- 
versation, “I’m too stuffed to move,” may be 


offered as a kind of anecdotal analogue of this 
inferred negative drive state. 
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internal stimuli only. This derives 
from the fact that the two compart- 
ments are nearly identical. In run- 
ning from dark compartment to dark 
compartment, the animal neither gains 
nor reduces any external stimuli. 
The inference of kinaesthetic stimula- 
tion acting as a secondary reinforce- 
ment of the responses producing it 
would serve to account for these re- 
sults and other like it. 

Although the data might be inter- 
preted as supporting any of the the- 
ories mentioned, the formulation of 
Hull appears to be the most explicit 
and the most general. 


SuMMARY AND CONCLUSIONS 


A light aversion experiment has 
been performed with white rats to 
determine the functional relationship 
between the strength of performance 
under zero drive and the number of 
previous reinforcements under maxi- 
mum drive. The strength of perform- 
ance under each drive condition was 
measured by the difference between 
the median log latency of response 
after training and the unconditioned, 
or free operant level before training. 
The response whose latency was 
measured may be molarly specified 
as a ‘cross-the-box-through-the-swing- 
ing-door’ response. Two zero drive 
conventions were considered, one a 
response-definition of zero drive as 
that level incapable of producing 
learning, the other a stimulus specifi- 
cation of zero drive as the total ab- 
sence of an aversive stimulus. The 
low level of illumination employed as 
zero drive met the first criterion and 
approached the second. The major 
findings of this study are enumerated 
below. 


1. When the primary light drive 
was reduced to zero, the strength of 
performance was found to be an in- 
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creasing, negatively accelerated func- 
tion of the number of previous rein- 
forcements under light drive. 

2. The performance strength under 
no drive was approximately S50 percent 
as high as that under high drive, when 
measured by the post- and pre-train- 
ing log latency differences, and ap- 
proximately 20 percent when meas- 
ured by the number of trials to reach 
a performance criterion. 

3. The lack of adequate conven- 
tions relating theory to data in this 
problem area has been noted and a 
justification presented for the conven- 
tions used in the present study. 

4. The results have been inter- 
preted as confirming the theoretical! 
hypothesis (Hull’s Corollary II) 
which was under test. 


(Manuscript received August 10, 1949) 
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THE EFFECT OF REMOTE ANCHORING POINTS 
UPON THE JUDGMENT OF LIFTED WEIGHTS * 


BY ROY K. HEINTZ 
University of Maryland 


INTRODUCTION 


One of the many problems which 
has arisen concerning the functioning 
of frames of reference is that of deter- 
mining the exact nature of the effects 
to be observed upon the introduction 
of remote anchoring points, i.e., stim- 
uli serving to delimit the basis for 
judgment, remote from those being 
judged in terms of units pertinent to 
the continuum involved. 

The principal impetus for studying 
this question derives from two experi- 
ments reported by Rogers (4). Using 


both inclined lines and lifted weights, he 
presented anchors of greater inclination 
and of greater weight, following a pro- 
cedure previously described by Volk- 


mann (7), and found that Os tended to 
judge the stimuli of his standard or de- 
pendently-judged series as less inclined, 
or as lighter, evidencing that the Os’ ab- 
solute or subjective scales shift toward 
the anchoring stimuli. Results led him 
to conclude that “up to a certain point, 
the anchoring effect as measured by the 
shift in category thresholds is a rectilinear 
function of the remoteness of the anchor- 
ing stimulus. At this point there is a 
break in the functions and a diversity of 
trends appears” (4, p. 40). 

Additional interest stems from Mc- 
Garvey’s results with verbal stimuli and 
verbal anchors. Reporting that results 
for two Os showed no evidence of dis- 
continuity, she concluded that “in the 
results of six subjects there was some 
evidence of discontinuity” (2, p. 78; 
emphasis supplied). 


* This article is based upon part of a disserta- 
tion submitted to the Faculty of Princeton Uni- 
versity in partial fulfillment of the requirements 
for the Ph.D. degree. The writer wishes to ex- 
press his appreciation to Professor Hadley Can- 
tril and Professor E. G. Wever for their guidance 
and direction. 


Hunt and Volkmann (1) have shown 
that anchoring effects can be obtained 
with affective stimuli. No evidence is 
afforded concerning alterations in scales 
as a function of the distances of anchors. 
More recently Postman and Miller have 
demonstrated that anchoring effects can 
be observed in temporal judgments. 
They do not mention the matter of dis- 
continuity, but report that “the flexibil- 
ity of the scale (i.e., O’s absolute or sub- 
jective scale) seems to be limited. .. . 
The introduction of more and more re- 
mote anchors yields only diminishing 
returns” (3, p. 49). 


PROCEDURE 


To secure further information concerning ef- 
fects of distant anchors a study was undertaken 
using the general procedure outlined by Rogers 
(4), with certain differences in detail as noted be- 
low. A series of weights was presented in keep- 
ing with the Method of Absolute Judgment (or 
Single Stimuli); after training, progressively 
heavier weights were introduced as anchoring 
points in the latter part of each session, in regu- 
lar ascending order, one only being used in each 
experimental session. 

The standard stimuli selected for presentation 
throughout the experiment weighed 84, 88, 92, 
g6, and 100 gm. Anchors were likewise sepa- 
rated by equal objective differences of four gm. 
They weighed 100, 104, 108, 112, 116, 120, 124, 
128, 132, 136, 140, 144, 148, 152, 156, 160, and 
164gm. The use of this small interval represents 
a material departure from the conditions of 
Rogers’ study, intended to form a more stringent 
test of the hypothesis and to furnish detailed in- 
formation concerning anchoring effects through- 
out a long range. 

Encased in standard fibre containers 2.5 cm. 
high X 6.5 cm. ia diameter, stimuli were pre- 
sented as rapidly as possible by means of a re- 
volving table, O’s vision being excluded by a 
large screen through which his wrist projected. 
Rest periods were allowed when requested. In- 
structions were similar to those given by Rogers: 


The stimuli you are to judge will be a series 
of different weights. You are to report their 
heaviness relative to each other, calling the 
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lightest weights ‘ONE,’ ‘TWO,’ and so on, up 
to ‘FIVE’ for the heaviest. Report your 
judgment on each weight, even though the first 
few judgments may be difficult for you to 
make. 


Three graduate students served as Os. With- 
out prior experience and naive as to objectives, 
each was given several sessions of practice in 
which anchors were not presented. F and W 
had three preliminary sittings each; A had five, 
his introduction being interrupted. Data drawn 
upon the first 250 judgments in each of these 
sessions are reported with that for the control 
series; the present study also differing from that 
of Rogers in the respect that a control series was 
run throughout the investigation to secure evi- 
dence of effects carried over from one session to 
another, and to enable measurement of the 
effects of anchoring within each day’s data. 

To this end, subsequent sessions were divided 
into two portions. In the first period, constitut- 
ing the control series, only stimuli of the standard 
series were presented, each being offered so 
times, in random order, for a total of 250 judg- 
ments. Immediately following, in the second 
portion of each sitting, anchors were introduced 
for the experimental series. 

Two procedures were employed in the experi- 
mental series. Under one, anchoring points 
were not called to 0’s attention, but where intro- 
duced randomly into the group of stimuli being 
presented, no interval of time or other cue dis- 
tinguishing the series. F and W both partici- 
pated under this condition, making 300 judg- 
ments in each experimental series, 50 in response 
to the anchor and 50 on each of the five standard 
stimuli, for a total of 550 judgments in each 
sitting. 

Under the second regimen, O was stopped 
momentarily following the control period, and 
anchors were identified for him during the experi- 
mental period by pairing them with the standard 
stimuli, the latter being randomized. This was 
the usage employed by Rogers. The special 
instruction given when O was arrested was simi- 
lar to that given his Os: 


The stimuli will now be presented to you in 
pairs instead of singly. The first member of 
each pair will tell you what you are to call a 
‘FIVE.’ The second member of each pair 
you are to judge as before, on a scale of five, 
using the lower categories for the lighter 
weights. 


A served under this condition. His task thus 
consisted of 750 responses per sitting: 250 dur- 
ing the control period and 500 during the experi- 
mental period, 250 of the latter being given in 
response to the anchor. 

The first procedure was instituted to test the 
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necessity for pairing stimuli with anchors, the 
original work by Wever and Zener (8) on the 
Method of Absolute Judgment suggesting that 
anchoring effects might be obtained without 
pairing. 

The first anchor presented to A was 100 gm., 
a weight identical with the greatest stimulus of 
the standard series, following Rogers’ usage. 
The first presented to F and W weighed 104 gm., 
representing the next unit above the greatest of 
the five standard stimuli with which it was 
randomized. 


STATISTICAL TREATMENT OF Data 


Two approaches can be employed in treating 
data obtained in this type of experimentation. 
One may calculate (a) mean values of the judg- 
ments assigned to each stimulus, or other func- 
tions of judgments assigned to each stimulus; or 
one may calculate (b) mean values of the stimuli 
to which each judgment is related, or other 
stimulus functions of the several judgments, 
¢.g., category thresholds. The former approach 
yields functions which recede from the anchors 
as more distant points are presented (declining 
where anchors are greater than standard stim- 
uli). Compared with mounting anchors the 
standard stimuli seem lighter, manifesting con- 
trast (§, p. 311, passim). The latter approach 
yields functions which trend toward the anchors 
(rising where anchors are greater than standard 
stimuli). With increasing anchors, judgments 
at first related to lighter or lesser stimuli are 
accorded to stimuli in the middle or even the 
upper reaches of the standard series. ‘This is a 
direct evidence that O’s scale has undergone 
extension (1, 2,3, 4,5, 7), and an evidence of 
assimilation (5, p. 312, passim). 

Both modes of analysis have been employed 
in previous studié¢s: Hunt and Volkmann (1) and 
Postman and Miller (3) using the former, Volk- 
mann (7) the latter. McGarvey (2) and Rogers 
(4) present their data in both forms, Rogers 
placing principal reliance upon the use of cate- 
gory thresholds. 

While the two approaches stand in logical re- 
lation, it remains to clarify their usage, for the 
techniques are not equally reliable nor equally 
serviceable; the difference appearing, moreover, 
to have a bearing upon the problem of discon- 
tinuity. The computation of judgment values 
related to the several stimuli of the standard 
series yields functions of equal reliability, inas- 
much as each stimulus is presented with a fre- 
quency equal to that of every other stimulus. 
The computation of stimulus values correspond- 
ing to the various judgments, on the other hand, 
is less satisfactory. The several categories are 
not used with equal frequency. In particular, 
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the higher categories are used less and less as 
anchors increase, until they are finally forsaken. 
The present results are analyzed in terms of 
judgments reported for each stimulus in view of 
these considerations. 


REsuLTs 


For clarity of presentation in limited 
space, graphs in Fig. 1 for each O have 
been constructed to show the mean 
values of all judgments assigned to all 
stimuli of the standard series com- 
bined, for each control period and 
each experimental period. Mean 
judgments on anchors are also shown.! 
Results for all three Os reveal that 
judgments upon the standard stimuli 
were progressively altered upon the 
introduction of more and more ex- 
treme anchors, the tendency to re- 
spond in terms of lower categories be- 
coming more pronounced as heavier 
and heavier anchors were submitted. 
Functions based upon them display 
downward trends extending to the 
end of the experiment, terminated 
with the introduction of the heaviest 
anchor. 

It is observed that the obtained 
curves decline in an irregular pattern. 
There are recurrent fluctuations and 
readjustments within the general 
downward trend. There is no evi- 
dence, however, of any well-defined 
‘break’ or ‘breaks.’ Noset or series of 
of points shows so marked a departure 
as to distinguish them unmistakably 
from other shifts occurring in other 
portions of the curves. Variability 
is apparent throughout the data, ap- 
pearing within sessions as well as be- 
tween sessions; in the control series as 
well as in the experimental series; in 


1 For detailed tables of the results order Docu- 
ments 2738 and 2739 from American Documen- 
tation Institute, 1719 N Street, N. W., Washing- 
ton 6, D. C., remitting $1.00 for microfilm (im- 
ages one in. high on standard 35 mm. motion 
picture film) or $1.20 for photocopies (6 X 8 in.) 
readable without optical aid. 


the practice data as well as in the ex- 
perimental data proper. The whole 
picture is that of a set of continuous 
curves showing an overall trend, vary- 
ing from point to point in a random or 
chance pattern. 

The standard stimuli were on the 
average judged lighter in the experi- 
mental series than in the control 
series. Means for the several stand- 
ard stimuli in the experimental series 
typically fell below the respective 
means for the same stimuli in the 
control period of the same session. 
Differences between the functions at 
the established points describe the 
effects of anchoring within sessions. 

Comparison of the graphs will dis- 
close that the effects of anchoring 
tended to be greater under the condi- 
tion of paired stimulation. Curves 
for A show greater and more consistent 
separation than do those for F and W. 

Data collected in the control series 
demonstrate that anchoring effects 
were carried over from day to day: 
that experiences with anchors in 
earlier sessions affected judgments 
in subsequent sessions. Curves based 
upon judgments obtained therein drop 
in a manner analogous to curves based 
upon results taken in the experimental 
series. In F’s words, “You've got me 
so that I expect light ones in the be- 
ginning, even before I start.” 

Under the randomized condition 
the adjudged values of anchors from 
the outset exceeded those for stimuli 
in the standard series. For F the 
mean values of judgments on anchors 
increased irregularly to the 16th ses- 
sion and diminished thereafter, never 
attaining the limiting value of 5.00, 
obtainable only through consistent 
judgments of ‘five.’ The curve for 
W reached 5.00 at the 12th session, 
then receded, ultimately to be estab- 
lished at this level from the 16th 
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session. The function for A is of 
course constant. 

F first indicated an awareness of an 
anchor at 116 gm., saying, “That felt 
like a real ‘five.”” W shouted ““‘six’”” 
upon one presentation of the 112-gm. 
anchor. At 112 gm., A recognized 
that, “The ones I am calling ‘fives’ are 
always heavier than the ones I am to 
judge on a scale from one to five.” 

Even though differences between 
means were reduced in the final ses- 
sions, Os clearly demonstrated ability 
to discriminate, variability of judg- 
ments notwithstanding. Means for 
the several stimuli were almost invari- 
ably in their proper rank orders. 
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There were few reversals. Of 436 pos- 
sible instances in which the value of a 
given mean might lie beyond the mean 
for an adjacent stimulus, only 15 cases 
were observed, counting four identi- 
ties. Of a possible 654 second-order 
reversals, in which the value of a 
given parameter might lie beyond 
those of two adjacent parameters, 
only one case appeared. 


Discussion or Resutts 


The results are in fundamental 
agreement with previous studies (1, 
2,3, 4,7) regarding the changes that 
occur. There is a reduction or ‘lump- 
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ing’ of judgments in the fact of greater 
anchoring stimuli, signifying an ex- 
tension of O's scale. Changes are 
most marked in the case of judgments 
reported upon standard stimuli proxi- 
mal to the anchors, as earlier studies 
have shown. 

The outcome does not agree with 
Rogers’ (4) findings respecting the 
rectilinearity, breaking, and subse- 
quent diversity of the functions. 
Since details of the studies differ in 
several respects, the disparity might 
be accounted for in several ways. 


1. Regression lines have not been em- 
ployed in the graphic treatment of the 
present results. Since their use may 
create both an unwarranted impression 
of uniformity and an exaggerated sense 
of difference, the writer feels that a more 
direct treatment is to be preferred. 

2. Certain errors in Rogers’ graphing 
contribute toward an appearance of dis- 
continuity. Two points for his subject 
C are not plotted in his Fig. 12, p. 33, on 
the ordinate of the 515-gm. anchor, as 
reported in his Table VI, p. 32. Com- 
parisons with the table reveal that the 
regression lines for the 165, 140, and 115 
gm. parameters are not properly related 
to their own respective points, thereby 
underestimating the slopes of these lines 
throughout the region of anchoring 
values 365 to $15 gm., inclusive.” 

3. Rogers relied largely upon data ex- 
pressed in terms of category thresholds. 
As seen above, such analysis is not uni- 
formly reliable. The limitations are 
such as to suggest discontinuity, inas- 
much as the upper thresholds are not 
represented in conjunction with the more 
distant anchors. The simpler treatment 
is free of this disadvantage. Graphs for 
Rogers’ subject C furnish a case in point. 
His Fig. 9 convinces the eye that a break 
occurs by virtue of the fact that higher 
thresholds are absent for greater anchors. 


* By applying the method of least squares to 
data in his Table VI, the interested reader will 
find that lines of best fit thorugh these points 
will prove more nearly continuous with those 
through the 165 to 314 gm. anchor range. 
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Fig. 12 seems less persuasive, even as 
constructed. In the former, only points 
for the lower two parameters might be 
fitted by continuous straight lines. In 
the latter, all six might conceivably be— 
especially if the errors in construction are 
envisaged. The writer feels it possible 
that the alleged discontinuity might have 
gone unnoticed if the more sophisticated 
but less serviceable mode of analysis had 
not been employed.? 

4. Being more protracted, the present 
study affords more information concern- 
ing effects of extreme anchors. As 
against the use of eight anchors, curves 
are based upon points expressing the 
effects of 16 and 17 anchors, permitting 
one to view the shifts and fluctuations in 
relation to larger and more inclusive 
trends. 

s. The anchor range is smaller aéd- 
solutely, but greater relatively, than that 
used by Rogers, being approximately 
four times the range between the great- 
est and the least of the standard stimuli, 
where his anchor range was about twice 
that of his standard series. The anchor 
range reported is 15 or 16 times the inter- 
val to be discriminated, in terms of 
grams, where the ratio of range to inter- 
val in Rogers’ investigation was approx- 
imately 10 to I. 

6. The interval between stimuli is 
about one fourth that utilized by Rogers. 
Barely discriminable, there is practically 
no chance for recognition of the indi- 
vidual stimuli. “The boundaries be- 
tween the weights are vague.” “It is 
only rarely that I feel confident about a 
weight; and that is only in the case of a 
very heavy one” (i.e., the anchor). 

7. The randomization of anchors forms 
a further point of dissimilarity. In it- 
self this innovation probably holds 
limited significance for the difference in 
outcomes. Results demonstrate that 
pairing is not necessary to secure anchor- 


* The reader will find that these statements 
concerning the relative merits of the two treat- 
ments are counter to Rogers’ statement that the 
calculation of mean judgment values “. . . 
do[es] not alter the experimental findings in any 
way, save possibly to make the break in the 
functions more distinct” (4, p. 32). 
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ing effects, though the effect may be less 
marked than under the condition of more 
frequent stimulation. 


It is possible that the discrepancy 
between the present outcome and Mc- 
Garvey’s report that records for a 
number of her Os show “some evidence 
of discontinuity” (2, p. 78) may be 
reconciled in terms of probable vari- 
ability in the functions and the sug- 
gestive nature of one of the statistical 
treatments. 

Results do not confirm Postman 
and Miller’s finding that anchoring 
effects approach a maximum with 
anchors at a limited distance, with 
diminishing returns thereafter. The 
writer recognizes that it is logically 
possible that anchoring functions may 
ultimately reach a point where further 
changes will be of an insignificant 
order. Their data indicate a limit 
with anchors at objective distances of 
approximately one half (400/75oths) 
to two thirds of the range of their 
standard series. The obtained re- 
sults show that changes continue to 
occur with anchors withdrawn to a 
distance four times the range of the 
standard series, suggesting that such 
a limit would only be reached with 
more remote anchors. Whether one 
can meaningfully compare such ratios 
is open to question, since the two con- 
tinua and perceptual processes differ. 
Some importance may attach to the 
fact that the interval between their 
anchors 3 and 4 is not as great, pro- 
portionately, as the intervals between 
the remaining anchors. Coupled with 
uncontrolled variability, the existence 
of which is patent in their graphs, ex- 
pected increments might fail to ap- 
pear. 

Results for F and W provide quan- 
titative evidence that so-called an- 
choring points are not fixed points 
forming an invariant limit for O's 
scale. The former never judged 
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them consistently; the latter did not 
do so until relatively late, when he 
announced that he was “reserving 
judgments of ‘five’ for an extreme 
class.” 

Since anchors are not subjectively 
stable entities, the extension of O's 
scale cannot be expected to ‘follow’ 
the regular objective increments. If 
it were to confrom to the anchor range 
with fidelity, all the standard stimuli 
should theoretically be judged in 
category ‘one’ in the final session, as 
the range of the standard stimuli (84 to 
100 gm.) is then precisely one fifth of 
the range between the least stimulus 
(84 gm.) and the anchor (164 gm.). 
The fact that extension is incomplete 
was first reported by Volkmann (7). 

A salient characteristic of the func- 
tions is their variability. A number 
of factors may be singled out as having 
special significance for the instability. 


1. The interval is close; discrimination 
is dificult; consistency can scarcely be 
expected. The fact that variability in 
the practice data is of the same order of 
magnitude as that in the experimental 
data shows that the fluctuations are not 
a function of anchors alone. 

2. Opposing forces are at work to in- 
fluence judgments. Counteracting the 
effect of the anchor and the effect of 
prior experience with other anchors, one 
can identify another learning effect and 
a set engendered by the instructions. 
Tresselt (6) has shown that experience 
with one scale affects judgments upon a 
subsequent scale, extended practice on 
the first slowing adjustment to the sec- 
ond. Preliminary practice and contin- 
ued contact with control stimuli thus 
theoretically serve in some degree to re- 
sist changes to be wrought by the an- 
chors. An instructionally-derived set in- 
fluences results in the same direction. 
Having been asked to judge on a scale 
from one to five, O attempts to distin- 
guish between the stimuli as long as he is 
able to detect differences, thereby main- 
taining the functions at a level above that 
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which might otherwise be expected. In- 
trospections reveal that Os were moti- 
vated to use all the categories. To con- 
sider implications, instructional set will 
artifactually prevent results from show- 
ing due or proper scalar extension in this 
type of experimentation. Momentary 
changes in the balance between these 
factors may act to produce irregularities. 

3. O's attitude introduces another 
source of variability. Disturbed by the 
intrusion of the anchor, he finds it diffi- 
cult to maintain a consistent approach. 
W asserted, “After being given a series of 
light and medium weights, suddenly a 
heavy one appears. This throws my 
judgments off, almost makes judgment 
impossible. ... 1 feel my judgments 
are being forced . . . by being required 
to judge the weights on a scale from one 
to five.” He expressed a desire to use 
“‘a very much wider scale,” but later re- 
versed himself: “I should find it much 
easier to use a three-point or a four-point 
scale; five is a bit fine.” Irregularities in 
his record beyond the 11th session un- 
doubtedly reflect this conflict. F re- 
ported much the same thing after the 
17th session. “In the beginning [con- 
trol series] you give me light ones-— 
‘ones,’ ‘twos,’ and ‘threes.’ It’s easy to 
judge because you have a small frame of 
reference, just a few units. Toward the 
end [during the experimental series ] you 
give me five units, and I find it more 
difficult to judge; my judgments tend to 
oscillate between high and low.” 

4. The anchoring point furnishes only 
a part of the frame of reference. This 
may be discerned in the foregoing re- 
mark, but it is revealed most lucidly in a 
comment by him at the 16th session. 
“IT am trying to give you my judgments 
of ‘four’ in relation to the ‘three.’ It is 
impossible to judge it in relation to the 
‘five’; there is too much difference. I 
just try to remember what the ‘three’ 
feels like and judge what may be a ‘four’ 
on the basis of that.” 


SUMMARY AND CONCLUSIONS 


To obtain further information con- 
cerning the effect of remote anchoring 
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points upon frames of reference a psy- 
chophysical study was undertaken in- 
volving anchors at objective distances 
up to four times the length or range of 
a standard or dependently-judged 
series. Using the Method of Abso- 
lute Judgment, Os were given practice 
at judging five equidistant stimuli 
ranging from 84 to 100 gm., inclusive, 
on a response scale from one to five, 
light to heavy. Subsequent sessions 
were each divided into control and ex- 
perimental periods. Anchors from 
100 and 104 gm. to 164 gm. were intro- 
duced during the latter in regular as- 
cending order, only one per session, 
being (a) paired with stimuli of the 
standard series under one condition, 
and (b) randomized with stimuli of 
the standard series under a second. 

Mean judgment values were deter- 
mined for each stimulus, this mode of 
analysis being preferred for reasons 
given. Results are in basic agree- 
ment with previous studies (1, 2, 3, 
4, 7) in showing that heavier anchors 
have the effect of reducing the cate- 
gory values of judgments assigned to 
the standard stimuli, signifying an 
extension of O's scale or frame. Col- 
lectively or severally, means for the 
experimental series tended to be less 
than corresponding means for the con- 
trol series. With the introduction of 
more and more remote anchors the 
functions declined progressively, al- 
though irregularly, throughout the 
range studied. 

In this respect the obtained results 
do not agree with Rogers’ (4) con- 
clusions concerning the rectilinearity, 
breaking, and subsequent diversity 
of the functions. The obtained 
curves appear to be continuous, sub- 
ject to chance variation, with no set 
or series of points departing unmis- 
takably from the larger downward 
trend. Results also demonstrate that 
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Pairing is not essential, though anchor- 
ing effects may be greater under the 
condition of more frequent stimula- 
tion. 

It follows that the results are also at 
variance with the report of McGarvey, 
who found “some evidence of discon- 
tinuity” (2, p. 78). 

The outcome does not confirm Post- 
man and Miller’s (3) finding that an- 
choring effects approach a maximum 
with anchors at a limited distance 
from the judged series. 

Despite the absence of anchors, 
functions for the control series tended 
to decline from session to session, 
demonstrating that present judgments 
on a given set of stimuli are influenced 
by prior experience with related 
anchors. 

Quantitative evidence is provided 
demonstrating that so-called anchor- 
ing points do not have point char- 
acter. 

Prior experience with the standard 
series was considered as a factor op- 
posing the influence of the anchors. 

Instructional set was identified as a 
factor which may operate artifactu- 
ally to prevent results from showing 
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proper extension of O’s subjective 
scale or frame. 

Introspective evidences reveal that 
anchoring points furnish but a part 
of the O’s frame of reference. 


(Manuscript received 
September 26, 1949) 
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PROACTIVE INHIBITION AS A FUNCTION 
OF RESPONSE SIMILARITY 


BY ROSS L. MORGAN AND BENTON J. UNDERWOOD 
Northwestern University 


INTRODUCTION 


The present study is concerned with 
the role of response generalization in 
proactive inhibition of verbal learning. 
By response generalization we mean 
that if Response B is associated with 
Stimulus A, other responses similar to 
B have a greater than chance prob- 
ability of being elicited by A. It is 
assumed that the influence of response 
generalization can be inferred from 
performance changes which accom- 
pany manipulation of material along 
any one of several dimensions of simi- 
larity. In the present study such in- 


ferences will be made as a consequence 
of variation in the synonymity of 


adjectives. 


In 1940 Gibson (1) published a com- 
prehensive theory of verbal learning 
based on stimulus generalization, dif- 
ferentiation, and associated phenomena. 
At that time evidence for response gen- 
eralization among verbal materials was 
negative, both Thorndike (8) and Mc- 
Geoch (5) having been unable to demon- 
strate its existence. Therefore, Gibson 
did not include response generalization 
among her explanatory mechanisms. 
Since 1940, enough evidence, both em- 
pirical and logical, has accumulated to 
suggest inclusion of response generaliza- 
tion as a part of any stimulus-response 
theory of verbal learning. 

At the empirical level, three studies 
argue for the importance of response gen- 
eralization as an explanatory concept. 
Haagen (2), in retroactive inhibition ex- 
periments, has shown that response syn- 
onymity produced greater intercondition 
differences than corresponding degrees of 
stimulus synonymity. There was more 
retroactive inhibition in an experiment 
where similar responses were learned to 


the same stimuli than in an experiment 
where the same responses were learned to 
similar stimuli. In both experiments the 
amount of retroactive inhibition de- 
creased with an increase in the degree of 
synonymity. Response generalization 
as the mediating mechanism is a straight- 
forward inference. 

Hughes (4) has given an empirical 
demonstration of response generalization 
in verbal learning. Ss learned a list of 
10 dissimilar adjectives as responses to 
10 dissimilar nonsense syllables. One 
week after learning, the 10 stimuli were 
presented and S was forced to give a re- 
sponse toeach. Judges scaled the errors 
which were made during the learning and 
recall along several dimensions of simi- 
larity to the correct responses. Errors 
which had the highest degree of scaled 
similarity to the correct responses oc- 
curred most frequently, thus providing a 
gradient of response generalization. 

Finally, Osgood (6) has shown that 
interference in learning the second of two 
lists is less if the responses are similar 
than if dissimilar when the stimuli are 
identical in both lists. 

These studies indicate that in account- 
ing for interaction between lists, response 
similarity, hence response generalization, 
must be reckoned with. 

A logical argument can be made for the 
importance of response similarity in 
learning the second of two lists when the 
stimuli of the two lists are identical. 
Maximum negative transfer (associative 
inhibition) occurs when a new response 
term is to be connected to an old stimulus 
term, while maximum positive transfer 
(associative facilitation) is obtained with 
continuous learning—when the same re- 
sponse term is connected with the old 
stimulus term. Since the stimulus rela- 
tionships for these two situations are 
identical, the difference in transfer effects 
must lie in the response relationships. 
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Filling in the points between the ex- 
tremes identified by the above two condi- 
tions should give a continuous change in 
transfer effects. Osgood (7) has recently 


argued for the importance of response 
similarity and one of the present authors 
(9) has given a rough theory of how re- 
sponse generalization would operate to 
produce the expected transfer effects as 
similarity is manipulated. This theory 
will be expanded in the discussion section. 


In the present experiment stimuli 
are identical and response similarity is 
varied from high to low. Inferences 
will be made concerning the influence 
of generalization (1) in learning the 
second list and (2) in the retention of 
the second list after 20 min. 


PROCEDURE 


Terminology.—The terms to be used in identi- 
fying lists and phenomena need specifying. On 
all conditions except the control condition, S 
learns two lists. These two lists will be called 
List 1 and List 2. Transfer phenomena which 
occur during the learning of List 2 will be identi- 
fied as associative inhibition and associative 
facilitation. Transfer phenomena which occur 
during recall and relearning will be identified as 
proactive inhibition and proactive facilitation. 

Basie conditions.—There are six conditions in 
the present experiment. Cond. A is a control 
condition on which a single list is learned and re- 
called. Conds. B through F consist of the learn- 
ing of two lists and recalling Lis: 2 after the rest 
interval of 20 min. Conds. B through F also 
have increasing degrees of similarity between the 
responses in the two lists. ‘Thus, Cond. B is the 
traditional interference paradigm, A-B, A-C, in 
which responses are dissimilar and stimuli are 
identical, while on Cond. F the responses are 
highly similar and the paradigm may be sym- 
bolized as A-B, A-B,. Gradations of similarity 
occur between these two extremes to make up 
Conds. C, D, and E. 

List-1 learning was always carried until S 
anticipated correctly seven of the 12 responses. 
Likewise, List-2 learning was carried to the same 
criterion. After 20 min., S recalled and re- 
learned List 2 to one perfect trial. Proactive 
inhibition will be determined by comparing the 
recall and relearning measures of the experi- 
mental conditions with those of the control con- 
dition (Cond. A). Associative inhibition or 
facilitation will be determined by comparing the 
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learning of the single list on Cond. A with the 
learning of List 2 on the experimental conditions. 
Materials.—The lists of 12 paired two-syllable 
adjectives were constructed by Haagen and used 
in his studies on retroactive inhibition mentioned 
in the introduction to this paper. Since his 
scaling procedure and results have been pre- 
sented elsewhere (3), they need not be repeated 
here. From the scaled words he constructed sets 
of two lists each which were roughly equal on all 
characteristics except response similarity. The 
stimuli in any of the two lists of a set were identi- 
cal, the responses varied in similarity. His ex- 
perimental data, as well as ours, show that no 
significant differences in difficulty obtained be- 
tween the lists of different sets taken individu- 
ally. Therefore, if differences in learning List 2 
of the various sets are obtained, as well as differ- 
ences in its recall, they must be attributed to 
differences in the response similarity from set to 
set. 
The differences in similarity may be illus- 
trated with word pairs taken from the lists. It 
will be remembered that Cond. B has dissimilar 
responses, Cond. C slightly similar, and so on, to 
Cond. F, which has high similarity. 
First List Second List 
Cond. B 
Cond. C 
Cond. D 
Cond. E 
Cond. F 


noiseless-latent 
gorgeous-tripping 
jellied-downcast 
ingrown-enraged 
willing-unclean 


noiseless-sincere 
gorgeous-agile 
jellied-sullen 
ingrown-angry 
willing-dirty 


Counterbalancing.—When the independent 
variable is a characteristic of the task itself, and 
when each S serves in all conditions, it is neces- 
sary to balance the tasks so that no bias due to 
practice and fatigue effects will enter. A form 
of systematic randomization was used which ad- 
hered to the following rules: (1) no two Ss 
learned the conditions in the same order; (2) 
each condition preceded and followed every other 
condition four times, and (3) each condition 
occurred four and only four times at any given 
stage of practice. With six conditions 24 Ss are 
necessary to carry out these principles. 

For each condition, two lists were used. For 
12 Ss one specific list was used as List 1 and the 
other as List 2; for the other 12 Ss the situation 
was reversed. Two lists also were used for Cond. 
A. This allowed additional practice to be given 
after the relearning of the single list, thus keeping 
the practice constant from day to day regardless 
of the condition. S recalled and relearned List 
2 on Conds. B through F. Following this re- 
learning, two recall trials were given on List 1 to 
avoid any possible ‘sets’ developing which would 
lead S to believe that the learning of the first list 
was unimportant. 
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Miscellaneous.—The words were presented on 
a Hull-type drum at the standard rate of two 
sec. for the stimulus followed by two sec. for the 
stimulus and response together. Eight sec. 
elapsed between trials. Three different orders of 
each list were used to avoid serial learning. 

One practice day was given during which S 
went through a complete proactive inhibition 
paradigm. General instructions were given on 
the practice day, these instructions being stand- 
ard except that S was requested to read aloud 
the words as they appeared and to correct aloud 
any errors in anticipation. 

The 20-min. rest interval between learning 
and recall was filled by having S work ‘set’ prob- 
lems similar to those used by Luchins and work- 
ing out various problems on a pyramid puzzle. 
These problems were counterbalanced with con- 
ditions so that if any possible interference oc- 
curred it would be the same for all conditions. 

The 24 Ss, 18 men and six women, were all 
enrolled in introductory psychology and were 
naive to verbal learning experiments. 


RESULTS 


List-1 learning. —Three measures of 
List-1 learning are shown in Table I. 
These are presented primarily to show 


that the lists of each set, when learned 
singly, did not differ appreciably in 
difficulty from those of any other set. 
None of the differences between any 
of the means for any measure is statis- 
tically significant. 

List-2 learning.—An analysis of the 
learning of List 2 to the criterion of 
seven out of 12 correct responses gives 
the necessary evidence to evaluate the 
influence of response generalization on 
transfer. The major data are given 
in Table II. Cond. A, on which no 
prior list was learned, serves as the 
control for evaluating the direction of 
transfer effects. In terms of trials 
to reach the criterion it will be noted 
that under all experimental conditions 
the speed of learning is faster than on 
the control condition. Furthermore, 
it can be seen that the rate of learning 
is directly related to response similar- 
ity. The function is not smooth, 
Conds. C and D being reversed, but, 
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Fic. 1. List-2 learning curves showing associative facilitation as a function of response similarity. 
Cond. A is the control condition; Cond. B had dissimilar responses, Cond. D moderately similar, and 


Cond. F highly similar responses. 


considering the extremes, the relation- 
ship is evident.! For example, the 
difference between Conds. B and E is 
1.37 + .42 trials (¢ = 3.26), which is 
significant far beyond the one percent 
level of confidence. 

The overall influence of response 
similarity can best be seen in the 
learning curves for List 2. We have 


!The results throughout show reversals for 
Conds. Cand D. We have been unable to deter- 
mine the cause for this from our data, and 
Haagen’s results (2) did not show these reversals. 
As a rough check on the scaling, the Ss in the 
present experiment judged the words along the 
similarity scale used by Haagen. The mean 
rank orders of the scaled items were the same as 
found by Haagen although our Ss tended to put 
the items closer together on the scale than was 
true of Haagen’s judges. 


plotted only the curves for Conds. A, 
B, D, and F as samples of the simi- 
larity continuum. At no point do the 
curves overlap. 

Cond. B is the traditional A-B, A-C 
paradigm. It is noted that in terms 
of trials to reach the criterion no as- 
sociative inhibition was evident, nor 
does any evidence present itself for 
such inhibition in the learning curve. 
If associative inhibition is to be found, 
the initial trials of the second list are 
most likely to reveal it. A compari- 
son of the mean correct responses on 
the first anticipation trial of List 1 for 
Cond. B with the mean correct re- 
sponses at the same location of List 2 
does give some evidence of associative 
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inhibition (significant at the five per- 
cent level), but no evidence of such 
inhibition is present on the second 
anticipation trial. Apparently, proc- 
esses producing associative facilitation 
quickly become greater than inhibi- 
tory processes. 

Inter-list intrusions in List 2.—In- 
trusions from List 1 which occur dur- 
ing the learning of List 2 are the most 
pertinent data for inferring competi- 
tion of responses. The total number 
of intrusions for conditions B through 
F is 7, 16, 32, 28, and 27. These in- 
clude only those cases in which a List-1 
response occurred in place of its cor- 
responding List-2 response. If we di- 
vide these totals by the number of 
trials to learn in order to obtain a rate 
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per trial, the following values are ob- 
tained: 1.03, 2.70, 5.06, 5.17, and 5.73, 
for Conds. B, C, D, E, and F, respec- 
tively. Thus, intrusions per trial in- 
crease directly as degree of response 
similarity increases. Wearethus faced 
with the apparent contradiction that 
associative facilitation and competi- 
tion of responses are directly related. 
A resolution of this contradiction will 
be considered in the discussion section. 

Proactive inhibition at recall.— 


Twenty min. after reaching the cri- 
terion of seven out of 12 correct re- 
sponses on List 2, the list was recalled 
and relearned. The mean number of 
correct responses on the first three re- 
learning trials are given in Table III 
and graphed in Fig. 2. 
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Fic, 2. Proactive inhibition as a function of response similarity. 
Conds. B through F represent increasing degrees of response similarity. 


Cond. A is the control condition. 
All points below the dotted 


lines for each of the first three relearning trials represent proactive inhibition. 
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TABLE Ill 


Mean Correct Responses on First 
Turee Revearninc TRIALs 














It should be noted first that the re- 
call scores for all experimental condi- 
tions are less than for the control con- 
dition, Cond. A. All conditions thus 
suggest proactive inhibition. How- 
ever, reliable proactive inhibition is 
produced only by Cond. B. The dif- 
ference between Conds. A and Bon the 
first recall trial, 1.96 + .46, gives a’t of 
4.26. Conds. C, D, E, and F do not 
differ significantly from Cond. A but 
all differ significantly from Cond. B. 
We may thus conclude that the condi- 
tions with similar responses produce 
significantly less proactive inhibition 
than the condition with dissimilar re- 
sponses. The trend for the similar 
conditions is that the greater the simi- 
larity the better the recall. It is ap- 
parent, however, that differences at 
recall as a function of response simi- 
larity are much less than during the 
original learning of List 2. Finally, 
we should note that while associative 
facilitation occurred during learning 
of List 2, no proactive facilitation ap- 


TABLE IV 

Rerearninc Scores 

M oM 
7.42 71 
7.38 59 
7-53 68 
8.12 71 
8.21 .80 
7-71 64 
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peared during the first three relearning 
trials. 

Relearning scores.—The mean num- 
ber of trials required to relearn List 2 
for all conditions is shown in Table IV. 
None of the differences is significant. 
We observe again, then, that the 
facilitation which occurred in learning 
List 2 does not appear in relearning 
the list to one perfect trial. 

Inter-list intrusions during relearn- 
ing.—The total frequency of responses 
from List 1 which occurred during the 
relearning of List 2 is shown in Table 
V. There are no consistent trends in 
the intrusions as a function of response 
similarity. It is noted that the condi- 
tion with highest response similarity 


TABLE V 


Inrer-List Ixrrusions Durinc RELEARNING 





Trial | 


Remainder 





(Cond. F) produced the greatest num- 
ber of intrusions. However, no ap- 
preciable difference exists among the 
other conditions. 


Discussion 


This section will be concerned with 
an elaboration of a theory of response 
generalization to account partially for 
the obtained results. 

Associative facilitation.—The first 
empirical generalization to be dis- 
cussed is that the greater the response 
similarity (with stimuli identical), the 
greater the associative facilitation. 
We will first present the general the- 
oretical approach and then bring in 
additional data which support it, 
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Assume first a gradient of response 
generalization which results from the 
learning of each item in List 1. That 
is, the gradient is built out from Re- 
sponse B as B is reinforced as a re- 
sponse to Stimulus A. This gradient 
implies that other responses, similar 
to B, have a greater than chance prob- 
ability of being elicited by A, this 
probability being greater the more 
similar the response to B. Such a 
situation is depicted in Fig. 3. By 
represents the response most similar to 
B, comparable to List 2 responses of 
Cond. F in the present experiment. 
By, represents low similarity, com- 
parable to Cond. C., while K repre- 
sents a dissimilar response in List 2, 
as was the case in Cond. B. 

On the basis of an assumption of a 
gradient of response generalization, 
we would expect that associative faci- 
litation would be greater the greater 
the similarity of the responses in the 
two lists. Before S starts to learn 
List 2 there is a supra-threshold tend- 
ency for Stimulus A to evoke Re- 
sponses B,, By, etc., this tendency 
being less the less the height of the 
gradient. In effect, because of the 
pre-learning of List-2 responses dur- 
ing the learning of List 1, less new 
learning will be required to reach the 
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Fic. 3. Hypothetical gradient of response 
generalization. See text for complete explana- 
tion. 


ROSS L. MORGAN AND BENTON J. UNDERWOOD 


criterion of seven out of 12 correct re- 
sponses. This pre-learning, occurring 
as a consequence of generalization, 
we shall refer to hereafter as parasitic 
reinforcement. 

Response generalization between 
lists is not the only mechanism operat- 
ing to produce associative facilitation 
in the present conditions. Since the 
stimuli were identical in both lists, it 
would be expected (according to Gib- 
son, 1) that stimulus differentation 
developed in List 1 would transfer to 
List 2. Since the stimuli were identi- 
cal in both lists of all conditions, trans- 
fer of stimulus differentation would 
not account for the varying amounts 
of facilitation observed. 

It is also possible that response dif- 
ferentiation may take place during the 
learning of List 1 and that this too 
would transfer to List 2, resulting in 
facilitation. If so, it would be ex- 


pected that the facilitation would be 
related directly to response similarity. 


That is, if Response B is differentiated 
from other responses in List 1, this 
differentiation should be more bene- 
ficial for learning response B, in List 2 
than in learning Response By. This 
can be tested indirectly. If there are 
differences in response differentiation 
of List 2 for the various conditions, it 
should be evident in the number of 
intra-list errors. Our count of these 
errors shows that there are no appreci- 
able differences in frequency of such 
errors among the various conditions. 
Thus, if transfer of response differenti- 
ation is an important factor in the 
facilitation observed in learning List 2, 
it does not show itself in the errors. 
Since response similarity within the 
lists was low, there is no reason to be- 
lieve that the transfer of response dif- 
ferentiation is an important factor in 
the present conditions. 


Additional evidence of parasitic rein- 
forcement.—-Certain empirical phenom- 
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ena can be deduced from our interpreta- 
tion of response generalization. Tests of 
three such phenomena can be made from 
the data. 

If List-2 responses are being parasiti- 
cally attached to Stimulus A during the 
learning of List 1, we should expect that 
List-2 responses will be anticipated cor- 
rectly sooner when (1) response similar- 
ity is high than when low, and (2) when 
number of reinforcements (correct antic- 
ipations) of an item in List 1 is high than 
when the number is low. Table VI 
shows that this expectation is confirmed. 
We counted the number of times each 
item was given correctly (reinforced) dur- 
ing the learning of List 1, and then deter- 
mined ‘what happened’ to the corre- 
sponding List-2 item on the frst antici- 
cipation trial. We have combined 
conditions in Table VI so that we have 
three categories of similarity (B, C-D, 
E-F) and three categories of List-1 re- 
sponse strength (0, 1-2, 3 or more). It 
can be seen that (1) the greater the sim- 
ilarity of responses the greater the per- 
centage of correct anticipations on the 
first trial of List 2, and (2) the greater 
the number of reinforcements during List 
1 the greater the percentage of correct 
anticipations on the first trial of List 2. 

Another phenomenon which would 
follow from the concept of parasitic rein- 
forcement is that a single correct antici- 
pation on List 2 would appear to result 
in greater response strength the greater 
the response similarity. This would ap- 
pear to be the case only because the 
higher the similarity the greater the 
strength of the response tendencies of the 


second list defore S has had an opportun- 
ity to give the correct response. To test 
this we observed ‘what happened’ to 
items on the trial immediately following 
the trial on which they were first antici- 
pated correctly in learning List 2. If 
the appearance of greater response 
strength per single reinforcement is to be 
demonstrated, the percentage repetition 
of response must be higher the higher the 
similarity of responses in the two lists. 
At the extremes of similarity the results 
are clearly in line with the proposition. 
On Cond. B, with 130 items observed, the 
percentage correct on the succeeding 
trial was 63.8. For Cond. F the per- 
centage repetition was 78.5, this value 
being based on 121 cases. All other 
conditions fell between these extremes 
but no clear gradation as a function of 
similarity was observed. 

A third phenomenon which might be 
expected from the parasitic-reinforce- 
ment interpretation is that the higher the 
response similarity in the two lists the 
more likely it is that corresponding items 
in the two lists will be reinforced. List 1 
was learned to seven out of 12 correct 
responses. It is thus possible that a 
maximum of five items received no rein- 
forcement during the learning of List 1. 
These five items would be relatively 
‘useless’ as far as facilitating the learning 
of List 2 if response generalization is the 
facilitating mechanism. , To derive max- 
imum associative facilitation by parasitic 
reinforcement the items learned in List 2 
should duplicate exactly the items 
learned in List 1. Or conversely, the 
items not reinforced in List 1 should cor- 


TABLE VI 


Parasitic REINFORCEMENT AS A Function oF Response SimILariry 
AND Frequency or Correct ANTICIPATIONS 








Similarity of Responses 





Anticipations B (Low Similarity) 


Number Correct | 
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C and D 


| E and F (High Similarity) 
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respond to the items not reinforced in 
List 2. Obviously, this extreme case 
would not be expected in actual practice. 
We might expect, however, that Cond. F 
would provide more instances in which 
neither of the corresponding items in the 
two lists (A-B, A-B;) were reinforced 
than would Cond. B (A-B, A-K). The 
data confirm this expectation. Of all 
items not reinforced on List 1, 27 per- 
cent of the corresponding items were not 
reinforced on List 2, while the compar- 
able value for Cond. F was 54 percent. 
The percentage values for other condi- 
tions fall in between, being 44 for Cond. 
C, 41 for D, and 44 for C. 

We submit the above three sets of 
auxiliary data as supporting evidence for 
interpreting the associative facilitation 
obtained as being a function of parasitic 
reinforcement produced by response gen- 
eralization. 


Inter-list intrusions during List-2 
learning.-The second major finding 
of the present study was that the 
greater the response similarity be- 
tween the two lists the greater the 
frequency of intrusions from List 1 
which occurred during learning of 
List 2. We thus have three directly 
related measures: (1) intrusions, (2) 
performance (associative facilitation), 
and (3) response similarity. Since 
intrusions are commonly considered 
to be the best index of response com- 
petition resulting in inhibition, we are 
faced with the problem of explaining 
an apparent contradiction. 

In the present conditions, where 
stimuli were identical and response 
varied in similarity, we have said that 
the major facilitating effect which re- 
sults from response generalization is 
parasitic reinforcement. But, we al- 
so know that a basic condition for 
overt competition of responses is that 
two or more responses be attached to 
the same stimulus. Furthermore, it 
should be more difficult to discrimi- 
nate between two similar responses 
than between two dissimilar responses. 
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It must be apparent, therefore, that 
we have a competitive situation exist- 
ing in our high response-similarity 
conditions. The inhibition, however, 
which results from this competition, is 
less powerful than is the facilitation 
produced by parasitic reinforcement. 
The net effect of the two processes is 
associative facilitation. 

On the basis of such an argument it 
may be wondered why Cond. B did 
not show more associative inhibition, 
since there would be little parasitic 
reinforcement involved. Two other 
factors making for facilitation prob- 
ably account for the findings of Cond. 
B. The Ss had only one practice day; 
hence, practice effects should provide 
positive transfer. Stimulus differen- 
tiation transferred from List 1 to List 
2 (as discussed above) should provide 
some facilitating effect. These fac- 


tors would, of course, produce facilita- 
tion in learning List 2 under all condi- 
tions, and while they would produce no 


differences in the conditions as a func- 
tion of response similarity, they con- 
tribute to the direction of the transfer 
effects. In short, we have identified 
several factors which would bring as- 
sociative facilitation, and one factor 
which would produce associative in- 
hibition. It is reasonable that the 
algebraic sum of the effects of these 
factors would give the empirical re- 
sults obtained. 

Proactive inhibition at recall.—The 
results showed that significant pro- 
active inhibition occurred on Cond. 
B. None of the other conditions pro- 
duced significant proactive inhibition, 
but on the other hand, there was no 
facilitation under these conditions as 
was true during learning. Relatively, 
the recall scores showed the same rela- 
tionship with response similarity as 
was found during learning but the 
differences among conditions 
much attenuated, At recall 


were 
there 
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was no consistent relationship be- 
tween intrusions and response simi- 
larity. No appreciable differences in 
frequency or locus of intrusions was 
apparent for Conds. B, C, D, and E. 
Cond. F showed many more intrusions 
than did any of these other conditions. 

An extension of the response-gener- 
alization theory used to account for 
associative facilitation would predict 
that proactive inhibition at recall 
would increase as a function of re- 
sponse similarity. The reasons for 
such a prediction are as follows. 
When an item in List 2 becomes equal 
in strength to its corresponding item 
in List 1, parasitic reinforcement 
should no longer facilitate the List-2 
response. Since both lists were 
learned to seven out of 12 correct, the 
items would be roughly equal in re- 
sponse strength at the time the rest 
interval is introduced. According to 
our reasoning, therefore, parasitic re- 
inforcement should be a minor factor 
at recall of List 2. Interference pro- 
duced by competition of responses, on 
the other hand, should be greatest for 
the high-similarity conditions. This 
should lead to greater forgetting for 
Cond. F than for Cond. B. Since 
quite the opposite results were ob- 
tained, a further evaluation is in order. 

What factors making for proactive 
facilitation may be present at recall in 
amounts directly related to similarity? 
If such factors can be identified, they 
could be said to counteract the com- 
petition factor and produce the ob- 
tained results. ‘Two such factors are 
identifiable. 


1. The assumption that parasitic rein- 
forcement is of little consequence at re- 
call is questionable. It is apparent that 
not all items in List 2 were equal to or 
stronger than corresponding List-1 re- 
sponses. Therefore, some influence of 
parasitic reinforcement should be present 
at recall, and, as will be remembered, its 


RAP ET ORERNP ITT S DPM FATT OEY AERO ey 


601 


facilitating effect should be greater the 
greater the response similarity. 

2. A second factor reflects a method- 
ological difficulty. This difficulty has 
been presented in detail elsewhere (9); 
hence, will be stated briefly here. The 
problem concerns the equality of learning 
of lists carried to the same mastery cri- 
terion at different speeds. It has been 
shown that a list learned rapidly to a 
given criterion is actually stronger than 
a list learned to the same criterion at a 
slow rate. If the two learning curves are 
extrapolated to a hypothetical ‘next 
trial,” it can be seen that the list learned 
most rapidly would have a greater fre- 
quency of correct responses. In the 
present situation the lists with highly 
similar responses were learned most 
rapidly, would therefore be somewhat 
stronger than the low-similarity lists, and 
would give results in the direction found 
at recall. 

Whether the facilitating influence of 
these two factors is great enough to 
counteract the inhibitory influence of 
competition brought on by response gen- 
eralization cannot be determined from 
the present experiment. The recall in- 
trusion data, which did not show consist- 
ent relationships with response similar- 
ity, argue that the situation at recall is a 
complex one.* 


Relearning.—The relearning data, 
which showed no appreciable differ- 
ences among conditions, are fairly 
straightforward. Since all List-2 re- 
sponses should become stronger than 
their corresponding items in List 1 
after a very few relearning trials, the 
influence of parasitic reinforcement of 
items in List 2 would be near zero. 
Once the List-2 responses become 
stronger than List-1 responses, com- 


? Factors which we have considered as com- 
plicating the recall are as follows: (1) degree of 
differentiation at end of List-2 learning as a func- 
tion of similarity; (2) recovery rates of general- 
ization tendencies during rest period as a function 
of similarity; (3) extinction of List-1 responses 
as a function of similarity. These relationships 
(if pertinent) are not known and cannot be 
determined from our data. 
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petition will be minimized and relearn- 
ing should occur at the same rate for 
all conditions. Actually, since recall 
was higher for Cond. F than for Cond. 
B (as illustrations of the extremes), 
Cond. F was relearned at a somewhat 
slower rate. This was probably due 
to the greater competition among the 
similar as compared to dissimilar re- 
sponses. 

One final analysis bearing on para- 
sitic reinforcement will be mentioned. 
During the relearning of List 2, List-1 
responses should be strengthened 
by parasitic reinforcement. The 
strengthening as a result of such rein- 
forcement would, of course, be less 
than that of List-2 responses as a re- 
sult of direct reinforcement, but 
should be measurable nevertheless. 
If such parasitic reinforcement was 
taking place during relearning, the re- 
call of List 1 after relearning of List 2 
should be greater the greater the simi- 
larity of responses. In the present 
experiments S was given two recall 
trials on List 1 after relearning List 2 
so we have an empirical check on the 
proposition. On the recall the mean 
number of correct responses for Conds. 
B through F was 1.7, 3.2, 3.0, 4.3, and 
6.4, respectively. 

Do these data support the idea of 
parasitic reinforcement during relearn- 
ing or do they indicate merely that 
there was less and less interference as 
similarity became greater? Cond. A, 
the control condition, gave a mean re- 
call of 5.6 items after 20 min. On all 
other conditions recall of List 1 was 
given after 30 to 35 min. (depending 
on individual learning rates). Under 
Cond. F the recall of List 1 was 6.4 
items—appreciably higher than on the 
control condition although the reten- 
tion interval was less on the control 
condition. This would argue that the 
gradient of recall of List 1 was not a 
function of less interference alone, but 
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rather, that parasitic reinforcement of 
List 1 took place as a result of response 
generalization during the relearning of 
List 2. 

All things considered, response gen- 
eralization seems to be a very useful 
explanatory concept. For associative 
facilitation as a function of similarity 
it has provided a fairly complete ex- 
planation of the empirical findings; for 
proactive inhibition of recall it has 
been less complete. 


SuMMARY 


The experiment was designed to 
determine the influence of response 
similarity on proactive inhibition and 
associative facilitation. Twenty-four 


Ss learned lists of 12 paired two-sylla- 
ble adjectives by the anticipation 
method. The degree of synonymity 
between response items in the two lists 
for each of five experimental condi- 
tions was varied systematically. On 


all conditions the two lists used had 
identical stimuli. The two lists were 
learned successively to a criterion of 
seven correct anticipations on a single 
trial, and the second list was recalled 
and relearned to a criterion of one 
perfect trial after a 20-min. rest inter- 
val. A sixth condition, on which a 
single list was learned and relearned 
after 20 min., served as the control 
condition. The results showed: 


1. Associative facilitation in learn- 
ing the second list increased as re- 
sponse synonymity increased. In- 
crease in response similarity was also 
accompanied by an increase in num- 
ber of intrusions from the first list 
during the learning of the second. 

2. Proactive inhibition of recall 
tended to decrease as response similar- 
ity increased, although singificant pro- 
active inhibition resulted only when 
responses were dissimilar. Inter-list 
intrusions during recall and relearning 
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showed no systematic relationship 
with response similarity. 


A fairly complete interpretation of 
the results on associative facilitation 
has been given by using the concept of 
parasitic reinforcement brought about 
by response generalization. The in- 
terpretation has been less complete for 
the recall results. 


(Manuscript received 
September 15, 1949) 
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REACTIVE INHIBITION AS A FUNCTION OF 
NUMBER OF RESPONSE EVOCATIONS 


BY PAUL S. SIEGEL 
University of Alabama 


It is a common observation, both for- 
mal and informal, that organisms often 
‘avoid’ the repetition of a reaction. To 
cite a few representative studies, Thorn- 
dike (19) has reported that human sub- 
jects usually fail to repeat a voluntary 
response upon the repetition of the origi- 
nal stimulus. Similar observations have 
been made by Luh (11), by Telford (17) 
and by Telford and Thompson (18). 
Further, animal studies have shown that 
the failure to repeat often occurs when 
the reaction is highly adaptive, i.e., ful- 
fills known needs. The experiments of 
Dashiell (2), of Dennis (3, 4), of Dennis, 
Henneman, and Varner (§), Dennis and 
Sollenberger (6), and of Wingfield and 
Dennis (2c) are illustrative. Pavlov 
(14) and Hovland (9) have both demon- 
strated a paradoxical decrement in the 
strength of a conditioned response follow- 
ing its reinforcement and Hovland has 
termed this the ‘inhibition of reinforce- 
ment.’ 

Recently, Miller (12) and Mowrer (13) 
have proposed a general explanatory 
principle which has been termed the 
Mowrer-Miller hypothesis. This princi- 
ple, which states that the occurrence of a 
reaction may weaken or inhibit the tend- 
ency to make the reaction (somewhat like 
the action of fatigue), has found its most 
rigorous formulation within the system 
of adaptive behavior developed by Hull 
(10). Here the principle is identified as 
the postulate of reactive inhibition (Pos- 
tulate 8) and is expressed as follows: 


Whenever a reaction (R) is evoked in 
an organism there is created as a result 
a primary negative drive (D); (a) this 
has an innate capacity (Jp) to inhibit 
the reaction potentiality (sEx) to that 
response; (b) the amount of net in- 
hibition (7g) generated by a sequence 
of reaction evocations is a simple linear 
increasing function of the number of 


evocations (m); and (c) it is a posi- 
tively accelerated function of the work 
(W) involved in the execution of the 
response; (d) reactive inhibition (JR) 
spontaneously dissipates as a simple 
negative growth function of time (¢’’) 
(10, p. 300). 


This postulate is employed by Hull in 
the derivation of such behaviorally di- 
verse phenomena as disinhibition, spon- 
taneous recovery, reminiscence, the econ- 
omy of distributed practice, and the law 
of less work or effort. Part (c) of the 
principle finds substantiation in the ex- 
perimental studies of Mowrer and Jones 
(13) and Crutchfield (1). Part (d) is in 
part supported by a study performed by 
Heathers (7). This paper proposes an 


examination of Part (b). 


It is assumed that when two well 
established but physically incompati- 
ble reactions (R; and R:) are simul- 
taneously evoked in a series of stim- 
ulus presentations, frequency of oc- 
currence (p) may be regarded as an 
index of the strength of the reaction 
tendency (R,; and R:).'| When this 
is taken in conjunction with Part (b) 
as stated above, it may be reasoned 
that, other conditions constant, a de- 
crease in the frequency of occurrence 
of R, should follow its evocation. 
This comes about through a weaken- 
ing of R, resulting from the action of 
reactive inhibition. Rg: enjoys a mo- 
mentarily greater relative strength 
and will occur with greater frequency. 
Further, the weakening of R,, reflected 
in the increased frequency of occur- 
rence of R2, should be directly pro- 
portional to the number of prior evo- 


! This is an amalgamation of several of Hull’s 
postulates (10): notably, 11, 12, and 16. 
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Fic. 1. Front view of apparatus 


cations of R,. The increase in the 
frequency of occurrence of R; is here 
conceived to be a direct behavioral re- 
flection of the growth of reactive inhi- 
bition. 

Metuop 


Subjects and apparatus.—Volunteers (250) 
were recruited from the undergraduate popula- 
tion of the University of Alabama.?_ None had 
completed advanced work in psychology, the sex 
distribution was 88 percent male and 12 percent 
female, ages ranged from 17 to 40 yr., and all Ss 
were predominantly right-handed. 

The apparatus (see Fig. 1) consisted of two 
ordinary light switches selected for similarity of 
appearance, and spring tension, mounted on a 
table 30 in. apart with a five-watt bulb stationed 
midway between and slightly beyond them. 
The switches were wired in series with the bulb 
and a remote key operated by E. The S was 
seated at a desk-arm type chair within conven- 
ient reach of both switches. 

Preliminary preparations consisted of having 
the seated S laterally adjust the chair until satis- 
fied that the two switches were equally easy to 
reach with the right hand. The general instruc- 
tions were then read by the E: 


Before you is a light bulb and two switches. 
When the light goes on, please turn it off, us- 
ing either switch. They are identical and both 
will turn off the light. It does not matter 
which one you use. After turning off the 


2Mr. Morris Steinberg of this department 
assisted in the conduct of the experiment. 
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light, return the switch to its normal position 
and bring your hand back to its original posi- 
tion. I can answer no questions about the 
nature of the experiment. Are the instruc- 
tions clear? 


Each S was then practiced four times in the 
sequence, RLLR. The following instructions 
were then read by E: 


This experiment is divided into two parts. 
During the first part, please use only the right 
switch. During the second part, you are free 
to use either switch. I shall tell you when the 
second part begins. Remember, during the 
second part you are free to use either switch. 


Ss were then given varying amounts of exer- 
cise with the right switch. At this stage, five 
groups (N = so) must be distinguished accord- 
ing to the number of right responses given? 
Group o—o right reactions; Group 20—20 right 
reactions; Group 40—40 right reactions; Group 
80—8o right reactions; Group 160—160 right 
reactions. 

During this series, the S set his own tempo. 
As soon as the reaction had been made and the S 
had returned his hand to the resting position, E 
presented the stimulus again. ‘The average rate 
of presentation was approximately one each two 
sec. At the end of the series, a two-sec. break 
was interpolated during which the further in- 
structions were read: 


You may now choose either switch. 


A free-choice series of 10 stimulus presenta- 
tions was immediately given at the same tempo. 
Following this the S was asked to account for his 
choices during the last series and dismissed. 

In the instance of Group 0, the absence of 
exercise with the right switch necessitated a 
slightly altered procedure. To the general in- 
structions was added: 


Remember, each time the light goes on, turn 
if off using whichever switch you prefer. Re- 
turn the switch to its original position and 
bring your hand back to rest. 


Practice was then given in the RLLR se- 
quence, the instructions, “You may now choose 
either switch,” were read, and the free-choice 
series immediately initiated. 


Resutts anv Discussion 


The number of left responses yielded 
by the subjects of each group during 


* The same sex distribution was observed in 
each group. However, an earlier and similar 
study (15) suggests no genuine sex difference in 
this situation. 
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TABLE I 


Frequency Distaisutions or Lerr-Cuoice Responses Durinc Free—Cuoice Series or 
10 Resronses As a Function or NuMBER oF 


No. Prior 
Right Ke 


sponses 


the free-choice series is found in Table 
I. The table is easily read. For ex- 
ample, the first entry (top, left) indi- 
cates that 17 subjects of Group 0 gave 
no left reactions during the free-choice 
series of 10 stimulus presentations. 
The other entries may be similarly in- 
terpreted. The mean for each group 
has been entered in the table as well as 
the simple proportion of left responses 
(frequency of occurrence). 

A consideration of the means (or the 
proportions) of Table I indicates a de- 
cided trend. As prior exercise in the 
right reaction increases, the frequency 
of occurrence of the left response in- 
creases; presumably, through the 
weakening of the right reaction 
through reactive inhibition. Apply- 
ing the chi-square test of independence 
to the data, a value of 67.8 is obtained. 
For 24 degrees of freedom, this is sig- 
nificant at a level well beyond the 
one percent fiducial limit. We may 
safely conclude that it is highly im- 
probable that the number of left re- 
sponses is independent of number of 
prior right reactions, i.e., statistically 
significant differences exist among the 
groups. 

Frequency of occurrence (p) has 


*Sixteen degrees of freedom were lost in the 
combination of frequencies for categories con- 
taining less than five cases. 


Prior Ricut Responses 


been plotted against number of prior 
reactions in Fig. 2. Inspection of the 
empirical plot strongly suggests recti- 
linear regression.» Employing the 
method of least squares, the straight 
line function, p = .002 n + .36, has 
been fitted tothe data. The fit is seen 
to bea reasonable one. It is of course, 
impossible to prove that this relation- 
ship is a straight line. Wecan merely 
show that the obtained deviation from 
rectilinearity is easily attributed to 
random or chance fluctuation. The 
test for rectilinear regression described 
by Snedecor (16, p. 382) has been ap- 
plied to the data and is summarized 
in Table II. The obtained F-value of 
.44 indicates that deviations of the 
magnitude here observed may be 
readily attributed to chance fluctu- 
ation about a true straight line. Re- 
garding the increase in p as a direct 


5 Like the means, the medians also seem to fall 
along a straight line. Means were elected here 
because of the ease of calculating the statistical 
significance of the deviation from linearity. 

*It is unfortunately true that this test as- 
sumes the normality of the underlying parent 
population. Consideration of Table I reveals 
that several of the distributions are quite skewed. 
There seems to be no normalizing transformation 
that would not overcorrect some of the distribu- 
tions. As derived from these data, this statistic 
contains an undetermined degree of error. It is 
hoped that the use of a large number of cases has 
minimized this error. 





REACTIVE INHIBITION 


reflection of the growth of reactive in- 
hibition, Hull’s principle is clearly 
verified. In view of the fact that a 
a voluntary reaction of the human sub- 
ject was measured, the results are re- 
markably uniform and the agreement 
quite striking. Perhaps this uniform- 
ity will lend substance to the most re- 
cently voiced plea by Hilgard (8) for 
a more naturalistic approach in psy- 
chological experimentation. 

It is interesting to inquire into the 
locus of reactive inhibition. The two 
gross possibilities that present them- 
selves are peripheral and central. 
Mowrer and Miller have plainly 
adopted a peripheral view. Reactive 
inhibition is conceived to be a state of 
muscular fatigue or strain, or both. 
Hull’s position is less clear, perhaps 
for the reason that the problem does 
not arise in a strictly molar system. 
It is to be noted that his formal postu- 
late contains no assumption as to 
locus. His sub-molar considerations 
seem to imply acceptance of the 
Mowrer-Miller hypothesis or ultimate 
peripheral localization. The terminal 
reports of our subjects are interesting 
in this context. Although illuminat- 
ing, no definitive answer emerged from 
an analysis of these introspective re- 
ports. Quite marked in the instance 
of the 80 and the 160 groups, was the 
frequent explanation of predominant 





Frequency Of Lett Reaction 











09 20 40. ee 
Wo Prior Right Reactions 
Fic. 2. The relationship between number of 


left choices made during the free-choice series and 
number of prior right reactions 





TABLE II 


Test ror Linear Recression 








Sum of Mean 


Source of Variation Squares | Square 





No. Prior Right Re- 
actions 

Linear Regression 

Deviations from Lin- 
ear Regression 











left choice as resulting from muscular 
fatigue or strain.’ Typical reports 
were: 


“*Cause my hand was tired.” 

“My arm was tired from going to the right 
one.” 
“Hand tired from going right.” 


Clearly, a peripheral locus is indi- 
cated. However, equally frequent 
were such reports as: 


“To break monotony.” 
“To vary the routine.” 
“Moved to left for variety.” 


A more central involvement seems in- 
dicated. In the last analysis, the in- 
hibition here observed probably has 
both peripheral and central aspects. 


SUMMARY 


Hull’s principle of reactive inhibi- 
tion, which states that a reaction is 


7 Extrapolation of the curve indicates that the 
frequency of occurrence of the left reaction would 
become 1.0¢ foilowing 320 prior right reactions. 
Beyond this point, frequency of occurrence will 
not reflect the further accumulation of reactive 
inhibition. Some other behavioral index must 
be employed. Although complaints were rare 
in our 160 group, this number of consecutive 
right responses approaches the endurance limit. 
The experimenter interested in an empirical 
check of the above prediction will be well advised 
to provide himself with means for countering 
physical aggression. 
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weakened in direct proportion to the 
number of times it is evoked, was ex- 
amined in an experimental situation 
affording the human subject the op- 
portunity for making either of two 
alternative responses. It was found 
that the frequency of occurrence of 
one response increases with the 
amount of prior exercise in the execu- 
tion of the other. Agreeing with 
Hull’s principle, this increase was 
found to be rectilinear. 


(Manuscript received August 26, 1949) 
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THE INFLUENCE OF THE INTER-TRIAL INTERVAL ON THE 
HUMPHREYS’ ‘RANDOM REINFORCEMENT’ EFFECT 
; DURING THE EXTINCTION OF A 
VERBAL RESPONSE! 


BY DAVID A. GRANT, JOHN P. HORNSETH, AND HAROLD W. HAKE 


University of Wisconsin 


The object of the present experi- 
ment was to investigate the influence 
of the inter-trial interval on the 
Humphreys’ ‘random reinforcement 
effect’ during the extinction of a ver- 
bal response. 

Humphreys, Miller, and Ellson (6) 
observed that variation of the inter- 
trial interval from 5 to 20 sec. had 
little effect upon the acquisition, ex- 
tinction, or spontaneous recovery of 
verbal expectations. These investi- 
gators remarked that more informa- 
tion was needed on expectancy and 
decremental factors in conditioning 
before definite conclusions could be 


drawn as to the role of expectancy (or 
similar processes) in conditioning. 
The growing literature on partial 
or intermittent reinforcement (7, pp. 
$22 ff., 4, pp. 112 ff., 3, 2,9) sustains 
interest in their remarks, and the pres- 
ent study extends their findings. 


APPARATUS AND PROCEDURE 


The apparatus was similar to that of Hum- 
phreys (5). Two frosted 60-watt bulbs were 
mounted on the left and right sides of a three-foot 
square, vertical, black board. The lights were 
controlled by a concealed electronic timer. The 
left-hand light served as the ‘conditioned stim- 
ulus’ (CS), and the right-hand light served as the 
‘unconditioned stimulus’ (UCS). A given trial 
was considered a ‘reinforced’ or positive trial if the 
right-hand light flashed on at the termination of 
the left-hand light. The interval between on- 
set of the left-hand flash and the occurrence of 
the right-hand flash was three sec. Negative or 


1 This research was supported in part by a 
grant from the Research Committee of the 
Graduate School of the University of Wisconsin 
from funds provided by the State Legislature for 
1948-49. 


non-reinforced trials also were given in which only 
the left-hand light was flashed. 

The Ss were 184 men and women students 
who volunteered from elementary psychology 
classes. They were all read standard instruc- 
tions directing them to guess on each trial 
whether the right-hand light would flash on. 
They were instructed to record their responses on 
IBM answer sheets immediately after the left- 
hand light came on. The Es checked carefully to 
see that the Ss followed these instructions. Ex- 
periments were carried on with groups of Ss in 
small lecture rooms. 

A tri-factorial experimental design was used 
as outlined in Table I. The three factors were: 
(a) spaced os. massed acquisition trials; (b) 
spaced os. massed extinction trials; and (c) 
50 percent random es. 100 percent positive trials 
or ‘reinforcement.’ The interval between 
massed trials was about five sec., and the interval 
between spaced trials was about 40 sec. In the 
50 percent ‘random reinforcement’ procedure 
half of the acquisition trials were positive and 
half negative. The positive trials were sched- 
uled randomly except that no more than two 
consecutive positive or negative trials were per- 
mitted. Twenty-three Ss were used with each 
of the eight procedures called for by the design. 


RESULTs 


The S could make two types of re- 
sponse. If he guessed that the second 
light would flash on, it was called a 
positive response. If he guessed that 
the second light would not flash on, it 
was called a negative response. The 
essential findings of the experiment 
are readily apparent in Fig.1. In this 
figure are plotted the acquisition and 
extinction of the positive responses 
for the eight groups of Ss. The data 
are averaged in percent frequency in 
successive five-trial blocks. The 
figure shows clearly thatall four groups 
receiving 100 percent positive trials 
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TABLE I 


Desicn or Tue Exprermment 


Proce- 
dure 


Extinction 


Acquisition 50 Trials a 
20 Trials 


Massed 
Spaced 
Massed 
Spaced 
Massed 
Spaced 
Massed 
Spaced 


A | 100% Massed | 
B 100% Massed 
& 100°,'Spaced 
D 100% "Spaced | 
FE | 50%"Random Massed! 
F } §0%‘*Random Massed} 
G | + 50%*Random Spaced | 
H | 50%!Random Spaced | 


during acquisition show a rapid rise in 
their frequency of positive responses, 
and all four groups receiving 50 per- 
cent positive trials continue to emit 
about 50 percent positive responses 
throughout acquisition. During ‘ex- 
tinction’ the frequency of positive re- 
sponses drops rapidly in all four 100 
percent ‘reinforcement’ groups, but 


the drop is much more gradual in the 
four 50 percent ‘reinforcement’ groups. 


The only factor whick clearly affects 
the responses is the partial ‘reinforce- 
ment.’ Inter-trial interval has no 
consistent effect upon acquisition, ex- 
tinction, or upon the partial ‘rein- 
forcement’ phenomenon. 


RESPONSES 





The differences between groups were 
tested by means of analysis of vari- 
ance. Two acquisition scores were 
obtained for each S by totalling sepa- 
rately the positive responses during 
the first 25 trials and second 25 trials 
of acquisition. An extinction 
was obtained for each S by totalling 
the number of positive responses dur- 
ing the extinction trials. Although 
these scores were not as continuous as 
might be desired, the results of the 
analyses were sufficiently striking to 
overcome this difficulty. The analy- 
ses aresummarizedin Table ll. (The 
acquisition analyses involve only two 
factors since the spacing or massing of 
extinction is irrelevant.) With an F 
of 70.39 during trials 1-25 and 500.46 
during trials 26-50, the partial ‘rein- 
forcement’ effect is highly significant. 
Inter-trial interval produces no sig- 
nificant effect upon acquisition, in 
general, or upon the partial ‘reinforce- 
ment’ phenomenon as shown by the 
insignificant Fs for interactions. 

The analysis of the extinction scores 
is equally clear. Partial ‘reinforce- 
ment’ affects extinction, as is shown 
by an F of 158.92, but inter-trial 


score 
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Acquisition and extinction curves of the Humphreys’ verbal response. The 


EXTINCTION TRIALS 
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percentage of 


plotted for successive five-trial blocks for each of the eight experimental groups. 





‘RANDOM REINFORCEMENT’ EFFECT 


TABLE Il 


Awatyses oF VARIANCE oF THE Acquisition anv Extinction Scores 


Source of Variation 





Partial os. 100% Reinforcement 
Acquisition Massing 

Extinction Massing 

Reinforcement K Acquisition 
Reinforcement X Extinction 

Acquisition X Extinction 

Reinforcement X Acquisition X Extinction 
Replication (Error estimate) 


* Significant at the 0.1% confidence level. 
** Error variance estimates. 


Acquisition 
Trials 1-25 


Acquisition 


Extinction 
Trials 26-50 


500.46* 
1/33-31 


1.16 


1.41 
(9.84)** (10.42)** 


(8.66)** 


All other entries in the table are Fs obtained by dividing the mean 


squares (in parentheses) into the mean squares for the sources of variation listed in the left-hand 


columns. 


interval has no apparent effect upon 
extinction regardless of the spaced, 
massed, partial or 100 percent char- 
acter of the previous acquisition trials. 


Discussion 


Inter-trial interval had little or no 
effect upon acquisition or extinction of 
the verbal expectations of the present 
experiment. This corroborates and 
extends the results of previous investi- 
gators (6). The present experiment 
like that of Sheffield (9) was designed, 
incidentally, to separate the effects of 
spacing acquisition trials from the 
effects of spacing extinction trials, so 
that the present conclusions have a 
broader basis than did those of the 
earlier paper (6). The absence of 
significant interactions showed further 
that the inter-trial interval had rela- 
tively little effect upon these differ- 
ences in acquisition and extinction 
which are introduced by partial ‘rein- 
forement.’ These results differ 
markedly from those of Sheffield (9) 
who used the running response of rats 
in a single alley. 

Humphreys, Miller, and Ellison rea- 
soned that phenomena determined by 
expectancy in different experiments 
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would show similarities. Further, 
they argued that the work decrement 
factor was of considerable import in 
bringing about differences between 
spaced and massed conditioning trials 
such as those reported by Calvin (1) 
and Reynolds (8). In view of the 
absence of decremental effect on the 
partial reinforcement phenomenon in 
the present study, this line of reason- 
ing leads to the prediction that mass- 
ing or spacing acquisition and extinc- 
tion of a CR such as the conditioned 
eyelid response would have little effect 
on the verbal expectancy aspect of the 
eyelid responses under partial rein- 
forcement. In the event that inter- 
trial interval is found to influence the 
relative amount of the Humphreys’ 
effect in eyelid conditioning, one of 
three interpretations would be indi- 
cated: (a) the ‘decremental’ effect and 
the ‘expectancy’ effect are mutually 
dependent; (b) these effects are non- 
linear; (c) the ‘decremental’ and ‘ex- 
pectancy’ effects are non-linear and 
mutually dependent. It seems clear 
that the influence of inter-trial inter- 
val phenomena will be found to vary 
from one action system to another and 
perhaps from species to species (9) so 
that general theoretical interpreta- 
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tions of these phenomena are pre- 
mature. 


SUMMARY 


Verbal expectations of 184 college 
students were studied using Hum- 
phreys’ technique in a tri-factorial 
experimental design in which (1) 
inter-trial interval during acquisition, 
(2) inter-trial interval during extinc- 
tion, and (3) 100 percent os. 50 per- 
cent ‘partial reinforcement’ were con- 
trasted. Fifty acquisition trials and 
20 extinction trials were used. The 
principal results were as follows: 


1. The work of Humphreys, Miller, 
and Ellson was corroborated and ex- 
tended. Inter-trial interval had no 
effect upon acquisition or extinction 
of the verbal expectations. 

2. A clear-cut partial ‘reinforce- 
ment’ or Humphreys’ effect was ob- 
tained. 

3. Inter-trial interval had no effect 
upon the Humphreys’ partial ‘rein- 
forcement’ phenomenon. 


The results suggest an interpreta- 
tion of the Humphreys’ phenomenon 
largely in terms of non-decremental or 
verbal factors as opposed to decre- 


mental factors. It is suggested that 
if inter-trial intervals affect the rela- 
tive degree of the partial reinforce- 
ment phenomenon with the eyelid CR, 
or other action systems which are sub- 
ject to decremental factors, the dec- 
remental and non-decremental fac- 
tors interact or are non-linear. That 
a general theoretical treatment of 
partial reinforcement phenomena is 
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premature is illustrated by the fact 
that the present findings differ 
markedly from those of Sheffield on a 
different response in a_ different 
species. 
(Manuscript received 
September 24, 1949) 
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THE RELATIONSHIP BETWEEN LENGTH OF INTERVAL 
SEPARATING TWO LEARNING TASKS AND 
PERFORMANCE ON THE SECOND TASK! 


BY CHARLES E. HAMILTON? 
From the Psychological Laboratory, State University of lowa 


The background of experimental work 
and thought regarding the variable of 
warm-up effect in human learning has 
been presented by Thune (5) in a preced- 
ing, collateral study and need not be re- 
viewed in detail here. In the verbal 
learning situation, by analogy with the 
motor learning situation as analyzed by 
Ammons (1), it may be presumed that a 
state or condition of warm-up is acquired 
by subjects during practice on the learn- 
ing task. When rest follows, the ac- 
quired warm-up state may be partially or 
completely lost during this time and 
must be regained during the course of 
subsequent practice on the same or 
similar tasks. The state of warm-up it- 
self is thought to consist of the appropri- 
ate attitudinal and postural adjustments 
adopted by subjects to accomplish the 
task. Such adjustments are acquired 
over a number of trials rather than in- 
stantly and are lost gradually, rather 
than suddenly during the rest period 
following. Thus, after practice on one 
task, subsequent performance on the 
same or a similar task should benefit as a 
direct function of the amount of practice 
given the first task immediately prior to 
the second, and inversely as the amount 
of time separating the two tasks is in- 
creased. These views indicate two sepa- 
rate but closely related experimental 
problems demonstrating their validity. 
The first, that of showing an increased 
level of performance on a second verbal 
learning task following different amounts 
of immediately prior practice on the first 
(rest interval constant) has been pre- 


1 Based on a dissertation submitted to the 
faculty of the Department of Psychology of the 
State University of Iowa in partial fulfillment of 
the requirements for the Ph.D. degree. The 
author is greatly indebted to Professor Kenneth 
W. Spence who directed the investigation. 

2 Now at Montana State University 


sented by Thune in a preceding study. 
The second, that of showing a decreased 
level of performance on a second verbal 
learning task following different amounts 
of rest separating this task from the first 
forms the subject for the present study. 
Irion (3,4) studying retention as a 
function of the length of rest interval 
presents evidence which indicates that 
part, at least, of the loss in retention ob- 
served with increased lengths of rest may 
be attributed to loss of warm-up adjust- 
ments rather than loss in the form of 
‘forgetting.’ He tested recall of subjects 
for a list of paired-adjectives learned for 
ten trials and relearned after intervals of 
o (massed), 1, §, 10, 20, or 40 min., or 24 
hours. He found that recall, as meas- 
ured by the first relearning trial, de- 
creased as the length of the rest period 
increased. To test whether such loss, 
seen in similar situations since the time 
of Ebbinghaus, could be attributed to 
loss of warm-up ‘adjustments,’ he intro- 
duced a condition intended to reinstate 
some degree of the postulated warm-up 
without formal learning taking place. 
This condition was intended for com- 
parison with the other conditions to 
determine whether as great loss in re- 
tention would be observed. This was 
accomplished by having one group of 
subjects duplicate the 24-hour condition 
with the exception that immediately 
prior to the first relearning trial (after 
24 hours rest) they were given one trial 
of naming colors exposed at the window 
of the memory drum in the same manner 
and rate as the words themselves, then 
immediately were given the relearning 
trials. Recall scores under this condition 
were superior to similar scores for each 
condition of rest from five min. to 24 
hours and very nearly equal to the one- 
min. rest condition. From this Irion 
concluded that the observed loss in re- 
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tention following rest under the various 
conditions was due at least in part to the 
loss of warm-up adjustments acquired 
during learning and the beneficial effects 
of the color-naming trial resulted from 
the reinstatement of some of these ad- 
justments. 

Irion’s results suggest that it is possi- 
ble to demonstrate a parallel sequence of 
events in the situation in which new ver- 
bal material is learned following the 
learning of a first, similar material, with 
a variable period of rest introduced be- 
tween the two learning tasks. Data 
relevant to this problem may be found in 
the study by Thune already cited. 
Among his experimental conditions is 
one which involves an interval of 24 
hours between practice on an earlier, 
warm-up list and the learning of the 
second test list. A comparison of the 
rate of learning of the second list under 
this condition with that under the condi- 
tion in which an equal number of trials 
were given on the warm-up list only to 
minutes prior to the test list, showed that 
the rate of learning following the longer 
lapse of time was significantly inferior to 
that following the shorter lapse of time. 
Since under these conditions new ma- 
terial was being learned following rest 
(rather than previously learned material 
being recalled), the evidence for the 
operation of warm-up effect (the loss of 
such adjustments) is relatively uncompli- 
cated. The demonstration by Thune of 
the adverse effect upon learning of a 
24-hour rest as compared with a 10-min. 
rest indicates limits within which the 
more precise nature of this functional 
relationship may be investigated. 


Purpose or EXPERIMENT 


The present experiment was de- 
signed to investigate the nature of 
the relationship between level of per- 
formance on a verbal learning task 
following previous practice on a simi- 
lar task and different lengths of inter- 
polated rest interval of less than 24 
hours. To this end, nine groups of Ss 
received 10 trials on a list of paired ad- 
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jectives (to be referred to as the warm- 
up list) followed by a rest interval of 
8 sec., 5, 10, 20, 40, 60, 120, 180, or 
240 min., and in turn followed by 10 
trials on a list of paired nouns which 
served as the test list. Results ob- 
tained in this fashion should provide a 
relatively direct measure of the extent 
to which warm-up adjustments ac- 
quired during practice on the first list 
are lost during the period of rest fol- 
lowing, as shown by level of perform- 
ance on the test list. Knowing from 
Thune’s data that such a loss is sig- 
nificant when a period of 24 hours 
intervenes, these data should provide 
information as to whether such loss is 
uniform and gradual over this period 
or possesses some other functional 
characteristic. 


EXPERIMENTAL TECHNIQUE 


Since the present study was conducted in 
close coordination with the preceding study by 
Thune, effort was made to duplicate insofar as 
possible experimental features of the two investi- 
gations. To this end, the learning materials 
employed, total number of learning trials on 
lists, and general procedures for the two studies 
are the same; specific differences lie in varying 
either the amount of immediately prior practice 
on the warm-up list, rest interval constant 
(Thune’s study), or holding practice constant 
and varying the rest interval (the present in- 
vestigation). For these reasons data so ob- 
tained are directly comparable from one study 
to the other. 

Apparatus and materials.—A Hull-type mem- 
ory drum was employed to present the paired- 
associate leagning material. This material was 
exposed at approximately a four-sec. rate includ- 
ing the two-sec. anticipation interval. An eight- 
sec. interval fas employed between successive 
presentations of a list. The memory drum was 
mounted in a screen so that E could be seated 
behind it and not be seen by the S. The ex- 
posure window of the drum was lighted by a 
shaded 7}-watt electric lamp. Lists of 15-unit 
paired associate one-syllable nouns and two- 
syllable adjectives were employed as learning 
materials. These were presented to Ss in three 
different orders to avoid serial position effects. 
The words for the lists were selected according 
to conventional rules for list construction; in 
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addition, an attempt was made to use pairs as 
nearly alike in associative value as possible. 
Materials employed during the rest interval 
introduced under certain of the conditions con- 
sisted of cartoons clipped from various periodi- 
cals and mounted on 5 X 7 cards. 

Subjects.—A total of 225 Ss served in the ex- 
periment; each S served under a single experi- 
mental condition only, so that each of the nine 
experimental groups consisted of 25 Ss. All Ss 
were obtained from undergraduate courses in 
psychology; none had served previously in a 
memory drum experiment and all were naive 
with respect to the purpose of the experiment. 
Ss were discarded from the experiment if they 
failed to meet certain criteria for minimal and 
maximal performance: (1) failure to make a 
correct response during either preliminary prac- 
tice trials or during the ten trials on the warm-up 
list, (2) in the case of the last five Ss in each 
group, failure to make at least 10 correct re- 
sponses on the warm-up list, or (3) accomplish- 
ment of a perfect score on one of the first five 
trials of the warm-up list. The total number of 
Ss so discarded was 31. Ss were assigned to the 
conditions in a partially random fashion; random 
assignment was made within the conditions em- 
ploying rest intervals of (1) from 8 sec. through 
10 min., (2) 20 and 40 min., and (3) 60 min. 
through 240 min. 

Procedure.—Two days of testing were em- 
ployed. On the first day general instructions 
were given and Ss learned a practice list of paired 
one-syllable nouns for six trials. This served to 
acquaint the Ss with the experimental situation 
and reduce variability due to differences in un- 
derstanding instructions, etc. The experiment 
proper was conducted on the following day. At 
this time Ss in conditions employing rest inter- 
vals up to and including 40 min. had one experi- 
mental session in which they first learned a list 
of two-syllable adjectives (the warm-up list) for 
10 trials. This was followed by a period of rest 
according to the condition assigned; at the con- 
clusion of the rest interval the test list con- 
sisting of a new group of one-syllable paired 
nouns was learned for 10 trials. During the 
rest interval Ss were required to sort cartoons 
described previously. Ss serving under condi- 
tions employing rest intervals over 40 min. were 
dismissed from the experimental room during the 
rest period. No restriction was placed on their 
activity during this period aside from a caution 
to not discuss the experiment with other indi- 
viduals. All questions were deferred until the 
end of the experiment. Scores were recorded in 
terms of the number of correct anticipations 
(i.ec., the number of correct response words 
spoken before the shutter raised) on each trial. 
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RESULTS 


The mean numbers of correct antici- 
pations for the six trials on the practice 
list, 10 trials on the warm-up list 
(list A), and 10 trials on the test list 
(list B), are shown in Table I, columns 
1, 2, and 3, respectively, for the sepa- 
rate experimental groups. Examina-_ 
tion of columns 1 and 2 of this table 
reveals no substantial differences in 
this index of learning among the dif- 
ferent groups of subjects; simple 
analysis of variance applied to the 
means of these groups for each of the 
two measures involved yields F-ratios 
less than one, indicating that the small 
observed differences cannot be con- 
sidered significant. There is no evi- 


dence that up to the point of the intro- 


TABLE I 


Mean NumBer or Correct ANnTICIPATIONS 
Durinc Learnine or Practice List, 
Warm-up List (List A), ann 
Test List (List B) 





Condition 
(Interval Be- 
tween Lists 


A and B) 


List A 
(10 trials) 


Practice 
(6 trials) 


List B 
(10 trials) 


8 Sec. 16.76 34.53 

-37 + 4.05 
5 Min. 32.92 
+ 4.49 
10 Min. 35.04 
+ 3.9! 


34-96 
+ 3.92 


20 Min. 


40 Min. 


60 Min. 


120 Min. 


180 Min. 


240 Min. 
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duction of different lengths of rest 
interval (between the warm-up list 
and test list) for the various conditions 
that the amounts of learning exhibited 
by the experimental groups were in 
any way different. Column 3 of this 
table presents the mean number of 
correct anticipations per condition for 
the total of 10 trials on the test list 
(list B). It may be seen from this 
column that the mean number of ccr- 
rect anticipations tends to decrease as 
the length of the preceding rest inter- 
val increases. The maximum differ- 
ence observed is approximately 25 re- 
sponses and is far greater than any 
difference observed among the groups 
on list A prior to any differential 
treatment. Simple analysis of vari- 
ance performed on these means yields 
an F-ratio of 4.01. This value is 


significant at the one percent level of 
confidence, an F-ratio of approxi- 


MEAN CORRECT ANTICIPATIONS 
aa oO 
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mately 2.60 being required at this 
level with 8 and 216 degrees of free- 
dom. 

Examination of the data in column 
3 of Table I reveals that the successive 
decrements in learning accompanying 
increasing length of rest interval tend 
to become smaller as the length of the 
rest interval is increased, and that 
rests of more than one*hour appear to 
have little additional effect on reduc- 
ing the amount of learning exhibited 
on list B. 

The relationship is indicated graphi- 
cally in Fig. 1, which shows the mean 
number of correct anticipations on list 
B plotted against length of rest inter- 
val since previous learning. In this 
figure the ordinate represents correct 
anticipations on list B and the ab- 
scissa, length of rest; each solid circle 
represents the mean number of correct 
anticipations during the ten trials on 








1020 40 60 


120 


MINUTES BETWEEN WARM-UP 
AND TEST LISTS 


Fic. 1. Mean correct anticipations during 10 trials on the test list following different intervals of 


rest between test and warm-up lists. 


Thune’s data cited. 


Each point represents the mean for a separate experimental 
group according te che condition of rest introduced. 


The point above 1440 min. is taken from 
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list B for a particular group of sub- 
jects (corresponding to the length of 
rest received between lists A and B). 
The value represented by the solid 
circle above the point labelled 1440 
min. (24 hours) on the abscissa was 
obtained from Thune’s data since his 
materials and experimental conditions 
for the determination of this particu- 
lar point were in every respect similar 
to those used in the present study. 
This point, therefore, provides a suit- 
able basis for estimating the decrease 
in learning speed resulting from a 24- 
hour rest interval and serves to indi- 
cate the close approach to the limit of 
effectiveness of the present experi- 
mental conditions. This observation 
indicates that no abrupt change in the 
trend determined under the present 
experimental conditions would appear 
if a full day intervened between lists 
A and B. 

A smooth curve has been drawn by 
inspection through the points in Fig. 1 
(disregarding for the moment the 
deviating results of the 120 min. condi- 
tion). The specific values of the 
parameters of such a curve, whether 
drawn by inspection or fitted by some 
other technique, are not of particular 
importance to the problem at its pres- 
ent stage of development. The main 
point of interest represented by this 
curve is its fairly close correspondence 
with the data and the fact that it re- 
flects a relatively large initial loss and 
negatively accelerated relationship be- 
tween learning decrements and length 
of time since previous formal learning. 
The obvious instances of departure of 
the obtained means from the fitted 
curve probably reflect the relatively 
large variability in the measure of 
performance used. For example, al- 
though the mean for condition em- 
ploying 120 min. rest appears to in- 
dicate a sharp reversal in the trend of 
the data, the differences between this 


mean on list B and the means for ad- 
jacent conditions employing 40, 60, 
180, and 240 min. are not shown to 
be significant on the basis of a t-test, a 
difference of at least 13.20 being re- 
quired for significance at the five per- 
cent level.’ 

The preceding analysis indicates 
that an over-all differential exists 
among the experimental groups fol- 
lowing different lengths of rest inter- 
val. The measure employed (total 
number of correct anticipations dur- 
ing 10 trials on the test list) does not 
indicate where during this learning 
period such differences may be at a 
maximum following their establish- 
ment. Thune (§) presents evidence 
from his experimental situation in 
which a similar differential was estab- 
lished, that the exhibited differences in 
learning speed are maximal during the 
initial portions of the new learning 
and tend to become smaller with addi- 
tional learning trials. A parallel find- 
ing might be expected in the present 
experiment. The same type of analy- 
sis as employed by Thune is used in 
the present instance. For purposes of 
such analysis the learning curves for 
list B were divided into thirds, each 
third including three anticipation 
trials. The average slope within each 
third was measured in terms of the 
mean increase in the number of cor- 
rect anticipations during that third of 
the learning. The results of this 
analysis are shown in Fig. 2. In this 
figure the mean increase in number of 
correct anticipations is plotted against 
each of the three divisions of the learn- 
ing of the test list (trials 1-4, 4-7, and 
7-10). It may be seen that the great- 
est differences among the conditions in 
mean increase in correct anticipations 
during the first third of the new learn- 


* The t for this comparison was based on the 
within-group variance estimate derived in analy- 
sis of variance of these means. 
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INCREASE IN MEAN CORRECT ANTICIPATIONS 
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1-4 


THIRD OF LEARNING TRIALS 


4-7 7-10 


(LIST B) 


Fic. 2. Mean increase in correct anticipations during each third of the learning of the test list for 


the nine experimental conditions. 


The notations for each curve indicate the length of rest separating 


the learning of the warm-up list and the test list for that condition. 


ing; these differences tend to become 
smaller during the second third and 
still smaller during the final third. 
This figure strongly suggests that the 
retarding effect on the new learning of 
the preceding rest interval acts pri- 
marily during the first few trials in the 
learning of the new list. 

The reversals seen among the curves 
during the second and final thirds of 
the learning probably occur because 
learning under conditions exhibiting 
the greatest mean increase during the 
first third tend to approach a ceiling 
of performance beyond which little 
additional gains are possible during 
the final third. Conversely, the con- 
ditions exhibiting the smallest mean 
increase during the initial third tend to 
exhibit correspondingly larger mean 
increases during subsequent thirds as 
they approach nearly the same limit. 


Some of the conditions show greater 


mean increases during the second 


third of the learning than during the 
first third; this is due to the tendency 
towards initial positive acceleration 
during the learning on list B for these 
conditions. 


Discussion 


The experimental operations de- 
fining warm-up effects that are con- 
tained in a general manner within this 
study are somewhat different from 
those gained from the earlier work of 
Bell (2), Ammons (1), and more re- 
cently, Irion (3, 4). These investiga- 
tors were concerned with the relearn- 
ing situation and their experimental 
operations were set in such a frame- 
work. Insofar as they were concerned 
with the warm-up variable the defini- 
tions proposed were in terms of the 
slope of the initial segments of the re- 
learning curves without application to 
more general situations (although 
Irion has stated the possibility). The 
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study by Thune serves to promote a 
more general set of operations applica- 
ble to the problem; e.g., “. .. in 
terms of the slope of the initial seg- 
ment of a corresponding learning or 
relearning curve obtained after a 
different amount of practice and/or a 
different length of retention interval” 
(5). Such redefinition permits the 
identification of warm-up effects in 
the situation involving the learning of 
new verbal materials as well as those 
involving the relearning of previously 
practiced material. Insofar as simi- 
lar results are observed, additional 
support is lent to the validity of find- 
ings in the earlier situations. 

In the present experiment, of course, 
the operations defining warm-up are 
even more generai than those pertain- 
ing to slopes of some particular seg- 
ment of the learning curves. It was 
assumed, simply, that if different 
groups of subjects came to the test list 
following different amounts of rest 


interval since previous formal learn- 
ing, they would do so under conditions 
of different amounts of warm-up to the 
situation (i.e., in some inverse propor- 
tion to the length of the rest intervals 


employed). Since the operation of 
other variables which might affect the 
rate of learning of the test list was re- 
duced to a minimum, residual differ- 
ences among the groups in exhibited 
learning (without confining observa- 
tion to a specific part of the learning 
curve) on the test list should then be 
attributable to differences in state of 
warm-up at the time this learning was 
begun. 

In the first part of the analysis of 
data presented, the differences among 
the experimental groups in terms of 
the total correct responses during the 
learning of the test list were demon- 
strated to be significant. These dif- 
ferences were obtained through em- 
ploying rest intervals within the limits 
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already available from Thune’s data 
(i.e., 24 hours) and serve to indicate 
that losses of warm-up gained during 
learning appear to occur rapidly with 
the cessation of formal practice and 
less rapidly as additional rest ensues. 
Within the conditions of the present 
experiment, such losses appeared to 
have reached their maximum by ap- 
proximately one hour. Some simple 
negatively accelerated function ap- 
pears to be suitable to adequately de- 
scribe this change in time. Repeated 
duplication of the design using a vari- 
ety of learning materials might even- 
tually lead to some consistent ‘best 
fit,’ which could be offered with more 
confidence than the simple inspection 
curve presented, but no rational 
grounds exist a priori to expect one 
particular function to the exclusion 
of others at present. In any event the 
type of function discovered by the 
best-fitting technique would be the 
product of somewhat arbitrary. deci- 
sions as to whether the measure of 
learning differences should be single 
trials, tangential points on a learning 
curve arbitrarily decided upon, or, 
possibly, arbitrarily selected groups of 
trials (as in the present experiment). 
For these reasons, stopping somewhat 
short of a completely exhaustive inter- 
pretation of the data at hand seems 
most consistent with their refinement 
and the demands of the problem. 

In the second part of the preceding 
analysis of data that bear on the 
locus within the learning of the test 
list for the maximum effects of a 
warm-up differential, it was seen that 
the initial trials on the test list showed 
the greatest differences among the 
various conditions. This finding may 
be taken as indicating that the initial 
differences in state of warm-up among 
the groups were lost as practice trials 
on the test list ensued and the groups 
having had the longest rest intervals 
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(suffering, presumably, the greatest 
losses in warm-up adjustments ac- 
quired during previous learning) re- 
gained these adjustments rapidly with 
additional practice on the test list. 
This is consistent with Thune’s find- 
ings on a similar analysis. It should 
be noted that whether the subjects 
begin the test list following a constant 
amount of immediately prior practice 
on the warm-up list and different 
amounts of intervening rest, or differ- 
ent amounts of immediately prior 
practice and a constant rest period,the 
problem at the time of the test list is 
exactly the same—different levels of 
the state of warm-up presumably 
exist for the different groups of sub- 
jects and the locus of maximum differ- 
ences in learning of the test list (re- 
flecting the hypothetical warm-up dif- 
ferential) should be observed to be the 
same; this has been the case with 
these concurrent studies. 

The results of this and related 
studies demonstrate that a factor of 
warm-up can operate under certain 
conditions to ptoduce differences in 
performance in learning situations. 
Two parameters of this factor have 
been investigated so far: (1) its ac- 
quisition as a direct function of massed 
practice on a learning task, and (2) 
its loss following acquisition as an in- 
verse function of time of rest following 
formal practice. Evidence presented 
by Thune further indicates that the 
acquisition of such a state does not 
depend on actual learning taking place 
in the learning situation, but only that 
subjects ‘go through the motions’ 
which are necessary in the actual 
learning. The precise nature of this 
factor is not of primary importance 
except that as it is adequately de- 
scribed, it may be differentiated from 
other factors affecting performance 
and its effect controlled where neces- 
sary. It would be premature to sug- 
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gest the necessity of such explicit con- 
trol in all experimental situations 
where state of warm-up might operate 
in a variable manner. This is for the 
reason that the acquisition of the ap- 
propriate attentive and muscular ad- 
justments termed warm-up may itself 
be learned in such a manner that the 
effects observed in the present experi- 
ment employing naive, relatively un- 
practiced subjects might fail to appear 
were highly practiced subjects used. 
In other words, the crux of the issue 
is the rate at which a warm-up state is 
achieved by the subjects during learn- 
ing. At least under the conditions of 
the present experiment, such a state 
was probably regained slowly enough 
at the time of the test list for differ- 
ences in this process among the condi- 
tions to be reflected by significantly 
different performances. Were the ex- 
perimental design such that subjects 
would achieve a high degree of prac- 
tice prior to completion of the experi- 
ment (as with a counter-balanced de- 
sign, for example, in which each sub- 
ject serves in all the conditions), no 
such differences might appear. This 
possibility offers subject-matter for 
further investigation to gain the de- 
sired answer. 


SUMMARY 


1. The present study was designed 
to ascertain the nature of the relation- 
ship between level of performance on 
a new list of paired-associates and the 
length of interval separating this list 
from a previously learned list when 
practice on the former list is held con- 


stant. It was expected on the basis 
of other studies that the relationship 
between length of interval and the 
new performance should be inverse; 
as the interval of rest is increased, 
performance level on the new list 
should decrease. 

2. Nine intervals of rest ranging 
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from eight sec. to four hours were em- 
ployed to test this relationship. Each 
interval of rest comprised a separate 
experimental condition. Individual 
groups of 25 Ss were utilized in the 
study of each condition; a total of 225 
Ss were employed. The learning 
materials were lists of paired adjec- 
tives or nouns. 

3. Records were obtained of the 
number of correct anticipations during 
ten trials on the new list under these 
conditions. As the period of rest sep- 
arating this list from the preceding list 
was increased, significant decreases in 
performance on the new list were ob- 
served. The observed decrease ap- 
peared to approach a maximum with 
intervals of approximately one hour. 
A negatively accelerated function was 
offered as best describing the observed 
relationship. It was proposed that 


the results of the present experiment 
can best be interpreted in terms of re- 


sponse set and the conditions under 
which it is produced and lost. This 
response set fact has been termed 
warm-up effect. 

4. Analysis of the mean increase in 
correct anticipations during each third 
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of the learning trials on the new list 
indicated that the differences in per- 
formance among the conditions were 
at a maximum during the first few 
learning trials. 

5. It was suggested that the rate at 
which subjects acquire warm-up set 
may itself be modified with practice. 
This possibility is offered for further 
investigation. 


(Manuscript received 
September 5, 1949) 
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ABSENCE OF REMINISCENCE IN THE SERIAL 
ROTE LEARNING OF ADJECTIVES 


BY CLYDE E. NOBLE! 


Tulane University 


INTRODUCTION 


Contrary to the classical obliviscence 
curves of Ebbinghaus (8), several psy- 
chological investigations (1, 10, 11, 28, 
29) dealing with human learning and 
retention have reported characteristic 
initial increases in the recall of partially 
learned materials following periods of no 
practice (cf. also 2,19, 31).2. This class 
of phenomena was designated reminis- 
cence by Ballard (1). The present paper, 
however, is concerned only with the 
short-time increment observed in serial 
verbal learning by the anticipation 
method, known as the Ward-Hovland 
effect. Ward (28) and later Hovland 
(10,11), developed a procedure which 
defined reminiscence as the superior re- 
call performance of an experimental 
group given a non-practice rest period 
over that of a control group without rest. 
Hovland obtained reminiscence follow- 
ing one presentation of a list of 12 non- 
sense syllables and also after learning toa 
ya criterion. Hull and his associates 
(15) have accounted for this phenomenon 
in terms of the unequal relative decay of 
excitatory (£) and inhibitory (/) proc- 
esses and the consequent spontaneous 
recovery of effective reaction potential 
(2) during rest. Their formal prediction 
(Theorem XXXV, Corollary 1) is as 


follows: 


In rote series learned by massed practice to 
the point where some, but not all, of the syl- 
lables are recalled correctly, the mean number 
of syllables susceptible of correct recall will 
increase for a certain length of time after the 
termination of active learning (15, p. 257). 


' Now at the State University of lowa. 

* Similar results have been reported with con- 
ditioned responses by Pavlov (21) and by Calvin 
(6), and with perceptual-motor learning tasks 
(cf. 2), which is more than an analogy according 
to the Lepley hypothesis (16, 17). 


Experimental evidence (10, 11, 15, 28) 
indicates that the optimal postlearning 
rest interval for this effect, in the case of 
nonsense syllables, is approximately two 
min. 

It could be argued that this deduction 
is applicable also to the serial learning of 
‘sense’ or ‘meaningful’ materials but there 
is evidence to the contrary. Deese (7) 
using poetry and the method of complete 
presentation, and Melton and Stone (20) 
using dissyllabic adjectives, 16-item lists, 
and more stringent criteria of mastery 
than those of Ward or of Hovland failed 
to confirm the implicate. Reminiscence 
also failed to appear in the latter investi- 
gation at the 4% level of mastery when 
the exposure rate was increased from a 
two-sec. one to one of 1.45-sec., a condi- 
tion which theoretically would favor its 
occurrence. Melton and Stone (20) and 
Buxton (2) have emphasized the distinc- 
tion between (a) an increment in recall 
following rest and (b) the effectiveness 
of distributed practice upon relearning. 
Melton and Stone found no evidence for 
the former, but there were insignificant 
indications of the latter for the two-sec. 
rate and for the 1.45-sec. rate. 

Earlier, Shipley (25) had reported non- 
reminiscence when lists of 8, 14, and 20 
nonsense syllables were presented at a 
2.3-sec. rate, but his failure to find remi- 
niscence may have been a function of 
several methodological variations upon 
the Ward-Hovland design: (a) his inter- 
list interval was 9.2 sec., (b) presenta- 
tion was by movie projection, (c) his Ss 
read jokes or worked puzzles during rest 
periods, and (d) the criteria of partial 
learning were different. 

Stone (26), using lists of 12 nonsense 
syllables exposed at a two-sec. rate and 
learned to a yg criterion, tested for remi- 
niscence after intervals of 2, 5, and 20 
min. He found no increment in recall 
after two min. but significant decrements 
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after 5 and 20 min. There was, however, 
an insignificant increase in the number of 
Ss showing recall gains after two min- 
utes, with highly significant decreases later. 
Stone’s analysis of the Ss who reminisced 
pertained only to the percentage differ- 
ence between the zero- and the two-min. 
conditions. The writer has computed 
x? values for each of the frequency dis- 
tributions exhibited by Stone under the 
©, 2, 5, and 20 min. delay conditions, 
finding marginal forgetting (P > 0.05), 
no change (P = 1.00), forgetting (P < 
0.01), and forgetting (P < 0.001), respec- 
tively. These tests are the basis of the 
statements given in italics, for which the 
present author assumes responsibility. 
It is pertinent to note here that these ab- 
solute differences as functions of time 
are compatible with Hull’s theory except 
for the non-gains in recall at two min. 
Stone suggested that the high variabil- 
ity of his data lessens the reliability of 
his conclusions. 

Recently, Buxton (3) reported a study 
of both nonsense syllables and adjective 
material, in which 16 items were pre- 
sented at a two-sec. rate with a six-sec. 
interlist interval, but learned to a pre- 
rest mastery criterion. He noted an in- 
significant gain for adjectives after 0.5 
min., then losses after 2, 5, 10, and 20 
min. With the syllables there was a 
significant initial increase up to two min., 
then decreases thereafter, in accordance 
with theoretical expectation. 

Psychologists studying reminiscence 
have found the problem of controlling 
S’s activities between learning and recall 
a vexing one, especially the matter of re- 
hearsal. Ward (28) had his Ss read 
jokes or simply give color-associations 
during the rest period, finding that 
reminiscence was greater under the lat- 
ter conditions. Hovland’s (10,11) Ss 
named colors exposed at a two-sec. rate, 
as in the present study. The Ss of 
Melton and Stone were instructed to 
name colors from cards ‘as fast as possi- 
ble’ (20, p. 296); and Buxton (3) gave his 
Ss ‘light magazines’ to read. It is prob- 
ably impossible to eliminate rehearsal 
entirely. Ten Ss in the present experi- 
ment reported intrusions during rest or 


during the interlist interval, and other 
investigators (4, §, 30) have encountered 
deliberate rehearsal with their Ss. How- 
ever, Buxton and Ross (5) have shown 
that neither the rehearsal variable nor 
that of the learning method (pronuncia- 
tion versus spelling the syllables) is of 
crucial importance in determining rem- 
iniscence. Withey, Buxton, and Elkin 
(30) have found, furthermore, that the 
nature of the color-naming activity does 
not produce significantly different recall 
scores, nor does the rate of naming seem 
to be important. Apparently we must 
search for other variables to account for 
the differences between the Ward-Hov- 
land and the Melton-Stone results. A 
possibly crucial factor is the variation in 
stimulus-meaning occurring in these ex- 
periments (2). 


In order to contribute further to the 
research on this problem the present 
experiment was carried out. The pro- 


cedure corresponds more closely to 
that adopted by Hovland (10, 11) 
than did the studies of Deese, Shipley, 


Melton and Stone, Stone, or of Bux- 
ton, but it does not duplicate Hov- 
land’s conditions. The design is 
comparable in all essential respects to 
those portions of his investigations 
dealing with massed practice effects, 
the principal altered variable being 
the degree of meaningfulness of the 
rote materials. 


PROCEDURE 


Apparatus.—Two memory drums of a modi- 
fied Lipmann synchronous motor type, Tulane 
design, were used to expose the adjective lists. 
The size of the aperture was 2 X j in. and the 
band width was three in. Six bands of adjec- 


* Both Ward (28) and Hovland (10, 11) used 
PE’s as measures of variability then carried out 
tests of significance with the CRpg. According 
to Peters and Van Voorhis (22) this is question- 
able practice and should not be applied to em- 
pirical distributions. Analyses in the present 
experiment are based on small sample theory, 
and all but one (cf. Table I) utilize the formula 
for the significance of differences between the 
means of matched pairs. 
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tives and one band of colors were prepared. The 
three practice and three experimental bands con- 
sisted of lists of 12 trisyllabic adjectives, each list 
beginning with a cue symbol formed by a row of 
asterisks. ‘The words were neither synonymous 
nor antonymous, and none appeared in more 
than one list. They were selected from the 
middle range (4000 to 6618) of Thorndike’s (27) 
list of the 10,000 most frequently used words. 
No word had the same initial letter or syllable in 
any list, and not more than three in a single list 
had the same terminal letter or syllable. Each 
adjective began with a consonant syllable, the 
order in each list being randomized. All possi- 
ble combinations of situation order (see below) 
were used, and the order of lists was assigned to 
these combinations at random. Ss were as- 
signed to the several situations by the same 
method. ‘The adjectives were typed in 14-point 
black capitals. The color band, consisting of 
two consecutive series of 16 different hues occur- 
ring in random order, was prepared by applying 
crayon directly to the paper. All randomization 
was effected by means of the table of random 
numbers given in Lindquist (18). 

The drums were placed side by side on a table 
facing S with adequate illumination from behind. 
E sat to S’s right, where he controlled the ad- 
ministration of stimuli and recorded S’s re- 
sponses. Experimentation was carried out in a 
small room in which auditory distractions were 
minimized. The presentation rate of both ad- 
jectives and of colors was two sec., and the inter- 
list interval was six sec. 

Method.—Twenty-four male undergraduates, 
naive to memory experimentation and uninformed 
of the purpose of the research, were engaged on a 
volunteer basis. ‘They served as Ss in a serial 
rote learning situation using the method of antic- 
ipation with massed practice. Each S reported 
to the laboratory for six consecutive days, of 
which the first three were devoted to practice and 
the last three to experimental sessions. The in- 
structions were modified from Hull et al. (15) and 
included the requirement that S correct errone- 
ous anticipations. All experimental procedures 
were given during the practice sessions, and the 
situation for any given day was one of the follow- 
ing: 1. One presentation of the list, rest pause, re- 
call after two min. Immediately after the last 
word was exposed, the color drum was started 
and S began naming the colors as they appeared. 
After two min. this drum was stopped and S at- 
tempted to recall the words by anticipating them 
on the adjective drum. II. Learning to a cri- 
terion of 7 out of 12 words correct, rest pause, conm- 
tinuation of learning to mastery (defined as one 
perfect recitation). On the trial immediately fol- 
lowing S’s correct anticipation of seven or more 
adjectives, color-naming was begun as in Situ- 
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ationI. Aftertwo min. S resumed learning un- 
til mastery. III. Learning to mastery. In this 
procedure S continued learning by massed prac- 
tice until reaching one perfect recitation. 

The following scoring criteria were adopted: 
a response was judged incorrect if it was preceded 
by two successive identical anticipatory errors, 
or if the adjective was mispronounced or only 
partially enunciated, or if an otherwise correct 
response coincided with the appearance of the 
response-word in the aperture of the memory 
drum. 


RESULTS 


Lack of evidence for reminiscence.— 
Table I presents the results of a com- 
parison of Situation I with Situations 
Il and III. There is no evidence for 
a difference in retention following one 
presentation of the list with and with- 
out the two-min. rest (P > 0.80). The 
analysis of individual Ss showing 
gains or losses with time, in fact, sug- 
gests forgetting since 58 percent of 
them suffered a decrement in reten- 
tion. In order more adequately to 
evaluate these data, McNemar’s x’ 
test of the significance of changes was 
carried out. This procedure tests the 
null hypothesis that equal frequencies 
of Ss will show increments and decre- 
ments in retention (N = 24, df = 1). 


TABLE I 


Retention Fottowinc One Presentation 
or List witH AND without Two-Min. 
Rest BETWEEN PRESENTATION 
AND RECALL 


Group Analysis 


Mean 
No. adj. correctly recalled 
immediately :* 
No. adj. correctly 
recalled after two min.: 


2.14 


1.67 
D = 0.47; t = 0.25; P > 0.80. 
Individual Analysis 


No. Ss showing recall increment: 5 

No. Ss showing no change in recall: 5 

No. Ss showing recall decrement: 14 
x = 3.37; P > 0.05. 


* In this case the data for Situations II and 
III were pooled, necessitating the use of the 
formula for independent samples. 
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TABLE II 


CoMPARABILITY OF EXPERIMENTAL AND 
Controt Groups BErore INTRODUCTION 
or Rest Pause 


Mean 
No. Adj. 
Correctly 

Antic. 


on Crit. 
Trial 
7-58 
7-79 
>.30 


Condition 


Without rest pause (Sit. III) 
With rest pause (Sit. II) 
P from t-test of difference 


After applying Yates’ correction for 
continuity, this distribution was found 
to be of borderline significance (P > 
0.05) in its departure from chance ex- 
pectation. There is, therefore, no 
evidence for reminiscence under this 
condition. 

Table II shows the comparability 
of the two groups in Situations II and 
III before the introduction of the rest 
pause. It will be noticed that, by 
two independent medsures, the groups 
are equated since the differences in 
the mean number of correct anticipa- 
tions on the criterion trial and in the 
mean trials required to reach that cri- 
terion are not significant (P > 0.30, 
P > 0.70, respectively). 

The principal results of the present 
experiment are exhibited in Table III. 
The recall scores (III-A) show a differ- 
ence in the direction of a decrement in 
retention following rest when learning 
has been to the criterion of seven 
words correct, but the difference is not 
significant (P > 0.30). As in Table 
I, forgetting is again suggested by the 
individual analysis, with 54 percent 
of the Ss showing a decrement; but 
the frequency data yield a x? which is 
not significant (P >o0.10). When 
the effects of distributed practice are 
indicated by the number of trials re- 
quired to relearn to the 7/12 criterion 
(III-B), there is no difference (P > 
0.80). Finally (III-C), when the 
trials required to proceed from seven 
correct to mastery are computed for 
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the rest and nonrest conditions the 
difference is not quite significant (P 
> 0.05), although it is in the direction 
expected on the basis of other empiri- 
cal findings (cf. 2, 19, 31). 

In Fig. 1 the composite performance 
curves for the two conditions of the 
experiment are plotted. The correct 
anticipations are shown as a function 
of the number of trials. It is evident 
that the introduction of the two-min. 
rest pause following attainment of a 
7/12 criterion resulted in accelerated 
learning. Both curves assume the 
form of the positive growth function 
deducible from the postulates of 
Hull’s (14) general theory of behavior. 

Serial position effects.—The graphs 
in Figs. 2 and 3 show the type of serial 
position effects deduced by Hull and 


TABLE Il 


RETENTION WITH AND wiTHout Two-Min, 
Rest Pause Fottowinc Learninc To 
Criterion or 7 Out or 12 
Apjectives Correct 
(A) Recall 
Group Analysis 

Mean no. adj. recalled immediately 
after crit. trial: 
Mean no. adj. recalled 2 min. after crit. 


trial: 
D = 0.37; t = 0.95; P > 0.30. 


Individual Analysis 


No. Ss showing recall increment: 
No. Ss showing no change in recall: 
No. Ss showing recall! decrement: 


x? = 1.89; P > 0.10. 
(B) Relearning to 7 Correct 


Mean no. trials required to relearn 
without rest: 

Mean no. trials required to relearn with 
2 min. rest: 


1.46 


1.50 
D = 0,04; t = 0.20; P > 0.80. 


(C) Proceeding From a Criterion of 
7 Correct to Mastery 


Mean no. trials to mastery without 
rest: 

Mean no. trials to mastery with two- 
min. rest: 


9.87 


7.87 
D = 2.00; t = 1.80; P > 0.05. 
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Fic. 1. Performance curves of mean number of repetitions of list requ..ed to anticipate correctly 
varying numbers of adjectives with and without two-min. rest pause (color-naming) introduced after 


reaching criterion of 7 out of 12 correct by massed practice. 


minations. 


his associates (15) and validated by 
Ebbinghaus (8), Ward (28), and Hov- 
land (10,11) for nonsense syllables, 
by Robinson and Brown (24) for three- 
place numbers, and by Melton and 
Stone (20) for dissyllabic adjectives. 
Fig. 2 presents the mean failures of 
correct anticipation as a function of 
the ordinal position of adjectives in 
the lists, when learning has progressed 
to the criterion of seven words correct. 
Fig. 3 shows the mean failures in vari- 
ous serial positions occurring between 
7/12 and 12/12 correct, with and 
without the two-min. rest. This 
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Fic. 2. Serial position effects: mean number 
of failures during learning to criterion of 7 out of 
12 adjectives correct. Data represent values 

from Situation II, each point being the mean of 
" 24 determinations, 


Each point is the mean of 24 deter- 


graph illustrates the advantages in 
error-savings of distributed practice 
in the central portion of the series. 
It confirms Theorem XXI, Corollary 
3 (15), which already has been verified 
for nonsense syllables by Hovland 
(10, 11, 12, 13) and by Patten (23). 
It is to be noted that Melton and 
Stone’s curves (20, Fig. 3) show in- 
creasing serial errors with longer rest 
intervals (2,5, and 20 min.). This 
has been cited (2, 20) as evidence of 
a reversal of the relative reminiscence 
effect predicted for the central posi- 
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Fic. 3. Serial position effects: mean number 
of failures in various positions between criterion 
of seven correct and mastery, with and without 
two-min. rest pause. Each point is the mean of 
24 determinations. 
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tions. This evidence must be quali- 
fied, however, by stating the opera- 
tions of measurement. Melton and 
Stone made their observations on the 
first postrest trial and not, as in the 
present experiment, throughout the 
postrest relearning period. Their 
data do not, therefore, falsify this 
deduction. 

Theorem XXII, Corollary 1, refer- 
ring to the greater mean number of 
failures at the posterior position than 
at the anterior position, is verified by 
the curve shown in Fig. 2. Theorem 
XXII, Corollary 3, however, is prob- 
ably not confirmed, since the curves 
in Fig. 3 fail to show a greater econ- 
omy of distributed practice at the 
posterior position. Theorem XXXI, 
Corollary 3, is not confirmed by this 
experiment. This corollary states 


that the eighth word in the series 
should show the maximum superiority 
of distributed practice over massed 


practice. The data in Fig. 3 indicate, 
to the contrary, that positions 6 and 
9 profit most by the interpolated rest, 
position 8 showing no change. The 
dip in the serial position curve for the 
nonrest condition at positions 7 and 
8 is unaccounted for, although it may 
be an artifact of the heterogeneity of 
the lists. The method used in con- 
structing them may certainly be im- 
proved upon, and a standard proce- 
dure for calibrating meaningful lists 
is needed in research of this type. 


Discussion 


There is no evidence for reminis- 
cence, either following one presenta- 
tion of the list or when learning has 
proceeded to a 7/12 criterion. The 
absolute differences measured by re- 
call scores and by the number of Ss 
showing changes are, moreover, in a 
direction opposite to that found by 
Ward and by Hovland. Similarly, 
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there is no indication of the effective- 
ness of interpolated rest in the case of 
relearning to a criterion of seven words 
correct. But there is an insignificant 
saving in the number of trials required 
to proceed from this criterion to 
mastery following rest. Thus, a di- 
lemma presents itself: why does the 
reminiscence phenomenon not occur 
under conditions regarded as optimal 
(for nonsense syllables) when the 
major condition varied is along the 
dimension of stimulus-meaning? 
Some have pointed out that reminis- 
cence designs have not been ade- 
quately standardized to permit the 
identification of such a critical vari- 
able. The objections raised, e.g., by 
Melton and Stone (20) and by Bux- 
ton (2) in regard to such inter-ex- 
perimental differences as spelling os. 
pronouncing, varied rest interval acti- 
vities, and correction es. non-correc- 
tion are probably now of little sig- 
nificance in the light of subsequent 
research, particularly the work of 
Buxton and his associates (4,5, 30). 
The present design more nearly satis- 
fies the Ward-Hovland criteria for 
producing reminiscence than have 
other recent experiments, and may 
therefore be a more valid test of the 
phenomenon than are those in which 
variables known to be relevant (length 
of list, criteria of partial learning and 
of mastery, practice effects) have not 
been properly controlled. 

The failure of reminiscence to occur 
in the learning of trisyllabic adjectives, 
chosen for their formal similarity to 
nonsense syllables, and under stand- 
ardized conditions with respect to 
number of items, degree of acquisition, 
test-retest controls, presentation rate, 
interlist interval, length of rest, color- 
naming activity, set, learning tech- 
nique, and level of practice is a finding 
of some consequence to learning the- 
ory. These results are also consistent 
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with the results of several analogous 
investigations (3, 7, 20). 

Serial position effects occurring in 
this experiment are in general agree- 
ment with the predictions of Hull et 
al. This constitutes evidence in sup- 
port of their mathematico-deductive 
theory of rote learning with respect 
to the concept of intraserial inhibition 
and the conditions of its removal.‘ 
It suggests further that their theoreti- 
cal system, although designed for non- 
sense material, may with certain 
modifications account for the phe- 
nomena of serial verbal learning, in- 
cluding the use of ‘sense’ lists. The 
writer believes it is possible to account 
for non-reminiscence and obliviscence 
under these conditions in terms of de- 
creased habit interference effects at 
the termination of active learning. 
It may be postulated that the incre- 
ments of interference (A/) are greater 
during the learning of nonsense syl- 
lables than in learning meaningful 
items. This assumption is based on 
the positive empirical relationship 
found by several investigators be- 
tween meaningfulness and rate of 
learning. The studies reviewed by 


*Certain psychologists have contended that 
reminiscence experiments have not conclusively 
demonstrated the dependence of the phenomenon 
upon the dissipation of inhibition. This is to 
question whether reminiscence is consistently as- 
sociated with certain operationally specified in- 
dices of inhibition (e.g., center serial position 
effects). Although Melton and Stone’s discus- 
sion indicated that it is not, it should be pointed 
out that the evidence they reported (20, Fig. 1-A) 
for an increase in intraserial inhibition with in- 
creased rate of presentation was not statistically 
significant. Neither, for that matter, are the 
differences between the two serial position curves 
shown above in Fig. 3 (P > 0.05), yet both the 
Melton-Stone data and those of the present 
author are suggestive of small accumulations of 
habit interference. It is implausible to expect 
the development of J to be an all-or-none affair. 
Hence, one wonders whether Melton and Stone’s 
results are not, in fact, consistent with the theory 
considered in this paper. 


McGeoch (19) suggest that this func- 
tion is linear within definable limits. 
Accordingly, it is supposed as a first 
approximation that the magnitude of 
these serial increments varies inversely 
as some function of the degree of 
meaningfulness (m) of the rote mater- 


ial; i.e., AJ -s(2). A consideration 


of this and certain other relevant vari- 
ables yields a functional equation re- 
garding the growth of J; 


t= s(n,5,%,4), 


where » = number of stimulus items 
in list; s = index of physical similarity 
of stimulus items; m = index of mean- 
ingfulness of items, such that 0.co < 
m < 1.00; and where t = duration of 
inter-item interval in seconds. 

It is thus postulated that familiar 
and meaningful material is easier to 
learn than unfamiliar or meaningless 
material because it has associated with 
it less J. Furthermore, material 
which generates less J from other 
sources—such as n, s, and t—will also 
be less difficult to learn. With the 
cessation of practice and serial rein- 
forcement, the value of effective exci- 
tatory potential (Z), where E = E — 
I, will decrease steadily with time 
rather than undergo an initial increase. 
This state of affairs is then observed 
behaviorally as unaltered recall or as 
forgetting depending, other things con- 
stant, upon the empirical value as- 
signed to m.® Such a deduction is in 
conformity both with the present re- 
sults and with the theoretical deriva- 
tion of the Ward-Hovland phenom- 
enon by the Yale group. The present 
point of view simply regards reminis- 
cence as a special case of a more gen- 
eral principle of retention. 

* Available evidence indicates that under 


standard conditions reminiscence will appear 
optimally when the value of m < 0.30. 
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The postulate set designed by Gibson 
(9), based on the concepts of conditioned 
generalization and differentiation, may 
account for additional mechanisms un- 
derlying these retention phenomena. 
She regards such interference effects as 
those found in nonsense lists as the result 
of intraserial generalization. Differ- 
ences in recall or in work-savings are as- 
sumed to be functions of the extent of 
initial stimulus generalization. Further, 
generalization tends toward a minimum 
for meaningful items and toward a maxi- 
mum for nonsense items. When it is 
observed, therefore, that more trials are 
required to learn nonsense syllables, the 
explanation is in terms of the difficulty 
of establishing conditioned differentia- 
tion among the several items. In other 
words, when meaning is defined as the 
differentiating characteristics of stimuli 
one from another, ease of learning is pro- 
portional to degree of meaningfulness. 
This definition of stimulus-meaning is 
useful but somewhat equivocal, since 
intralist similarity-difference is a dimen- 
sion logically distinct from that of mean- 
ing-nonmeaning. Thus, material may 
be highly meaningful and acoustically 
similar (e.g., noun homophones), or re- 
latively meaningless and visually similar 
(e.g., homomorphous nonsense syllables 
of low association value), yet the be- 
havioral effects of both would, in Gib- 
son’s view, presumably be referable to 
the same set of explanatory principles. 
In view of the conceptual independence of 
these dimensions, therefore, it seems rea- 
sonable to use an association value as the 
index of stimulus-meaning (m) and to 
define intralist similarity (s) in terms of 
the physical (formal) characteristics of 
the stimuli (e.g., by a psychometric 
scaling method). Accordingly, the func- 
tional equation for J above has included 
expressions to denote the effects of vari- 
ation along each of these dimensions. 


Certain alterations seem to be re- 
quired in Hull’s system if it is to ac- 
count for serial verbal learning phe- 
nomena other than those for which it 
was designed. The changes suggested 
here are only tentative; more precise 


formulations await further empirical 
data. 
SUMMARY 


1. Twenty-four practiced Ss, 
equated as to pre-rest level of ac- 
quisition, learned lists of 12 trisyllabic 
adjectives by the anticipation method 
at a two-sec. presentation rate with a 
six-sec. interlist intervah Recall and 
relearning data computed with and 
without a two-min. rest interval of 
paced color-naming were analyzed for 
reminiscence effects. 

2. There was no evidence for remi- 
niscence following a single presenta- 
tion of the list. An analysis of the 
performances of individual Ss indi- 
cated forgetting of marginal signifi- 
cance. 

3. The introduction of a two-min. 
rest following learning to a criterion of 
seven words correct resulted in no 
difference in the mean number of 
words correctly anticipated on the 
first postrest trial, nor did it yield a 
work-saving in relearning to the same 
criterion. This was the crucial test 
for reminiscence in the present experi- 
ment. There was, however, a saving 
of borderline significance in the mean 
number of trials required to continue 
learning to mastery from that criterion 
following rest. 

4. The effect of interpolated rest, 
considered graphically in composite 
curves, was to reduce the negative ac- 
celeration of the performance growth 
function. The error-saving, in terms 
of serial position effects, appeared 
most clearly in the central and post- 
central portions of the lists. 

5. The present findings with respect 
to serial position effects are in partial 
conformity with those of Ward and of 
Hovland, but they fail to confirm their 
reminiscence effects as defined by dif- 
ferential recall. The data are in gen- 
eral agreement with those of Melton 
and Stone and of Buxton. 
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6. These results seem adequately 
accounted for in terms of the differ- 
ential decay of inhibitory and excita- 
tory processes, according to the the- 
oretical analysis of Hull etal. Certain 
modifications seem necessary in their 
system, however, if it is to mediate 
the deduction of serial verbal learning 
phenomena using meaningful items as 
well as nonsense syllables. 

7. An alternative postulate is pro- 
posed for the growth of J which util- 
izes the mean association value, m, of 
the material as an index of stimulus- 
meaning. It is suggested that this is 
a critical variable operating in recent 
experiments on reminiscence. 


(Manuscript received 
September 14, 1949) 
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TRANSFER OF VERBAL TRAINING TO A MOTOR TASK! 


BY KATHERINE E. BAKER AND RUTH C. WYLIE 
Connecticut College 


INTRODUCTION 


Several investigations in the field of 
transfer of training have shown positive 
transfer from so-called mental or ‘imagin- 
ary’ practice to a number of motor tasks. 
Sackett (5,6) found that relearning a 
maze was facilitated by either drawing 
the pattern or thinking through the maze, 
with the drawing type of rehearsal hav- 
ing the greater transfer value. In an- 
other maze learning experiment (8), the 
subjects who reported having depended 
primarily upon explicit or implicit ver- 
balizations learned more efficiently than 
those who said that they had depended 
primarily upon visual imagery or a motor 
mode of attack. Another study (4) 


compared the efficiency of actual and 
‘imaginary’ practice on five motor tasks, 
ranging from those demanding simple 


coordination to those requiring ideational 
processes. Both types of practice were 
found to be effective, with the imaginary 
practice having its maximum effective- 
ness for those tasks involving ideational 
activity. In an experiment investigat- 
ing the relative values of 18 days of 
physical and mental practice upon dart 
throwing and basketball shooting (7), it 
was found that mental practice was al- 
most as effective as actual practice in 
improving the motor skill. 

Although these experiments show 
amazingly large amounts of transfer re- 
sulting from mental practice, none of them 
provides a basis for analyzing just what 
kinds of mental practice are occurring 
and what the theoretical explanation of 
their transfer value might be. Neither 
the nature of the task nor the experi- 
mental design makes it possible to deter- 
mine the relative roles of verbalization, 


''The money used to pay for the services of 
experimenters was supplied under Contract 
N7onr-316, Task Order I, between Special De- 
vices Center, Office of Naval Research, and Con- 
necticut College. 


visual imagery, or implicit motor re- 
sponses. In some types of motor tasks, 
the major difficulty is encountered in 
learning the relationship between signal- 
ling stimuli and required responses, e.g., 
card sorting, or complex choice reaction 
time tasks. In such tasks, we can sur- 
mise that verbalizations, and perhaps 
visual imagery, should be the most ef- 
fective kinds of mental practice. In 
other tasks, such stimulus-response rela- 
tionships may be few and simple, and the 
difficulty of the task lies in learning to 
associate kinesthetic cues from precise 
body movements with the visual or other 
stimuli which signal that the movement 
has led to correct results, e.g., the correct 
ball-basket relationship is easy to learn, 
but it is difficult to discover which kin- 
esthetic cues will be followed by the 
sight of the ball dropping through the 
hoop. In this sort of task it is probable 
that implicit verbalizations would be of 
little help (except perhaps in reviewing 
general facts about posture, arm posi- 
tions, etc.) and that the most effective 
mental practice would consist of implicit 
motor responses. In terms of this analy- 
sis, many of the tasks used in the experi- 
ments cited above were of such a nature 
that both verbalization and implicit 
motor practice might have been used 
with some transfer value, thus preventing 
one from determining the relative effec- 
tiveness of either type of rehearsal. In 
addition, the design of the experiments 
involved initial test trials on the actual 
tasks which were later to be practiced 
mentally. Thus, the subject had an op- 
portunity to observe the response to be 
made, and might have practiced them 
implicitly during the mental rehearsal 
interval. In such cases, even if verbal- 
ization was the chief type of mental 
practice used, its effect would not be 
clearly separable as such. 

Another factor which has not been 
adequately controlled in previous ex- 
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periments is the amount of mental re- 
hearsal given. When attempts are being 


made to compare types of practice, it 
would be desirable to have an exact and 
known number of trials given to each 
group of subjects. 


It was the purpose of the present 
experiment to control these factors by 
an experimental design which would 
markedly reduce the possibility of 
motor practice occurring simultane- 
ously with verbal practice, and would 
definitely determine the number of 
training trials. In the training task, 
the subject is asked to memorize, by 
the paired associates method, verbal 
expressions of the stimulus-response 
relationships which he will later en- 
counter in a final motor task. The 
final task was a discrimination learn- 
ing problem, involving four lights, 
each of which could be turned off by a 
given switch. This task seemed to be 
one of those in which the most difficult 
associations lent themselves to verbal 
expressions, so that positive transfer 
from verbal practice might reasonably 
be expected here. The possibility of 
the occurrence of implicit motor prac- 
tice was further controlled by giving 
the verbal training without the sub- 
ject’s knowing that any motor task 
was to follow. 

Besides exploring the transfer value 
of verbal training, we also propose to 
attempt an explanation of the present 
results in terms of the theoretical con- 
cepts which have emerged from other 
transfer experiments, using different 
types of training before this motor 
task. 

MetHop 


The motor task set up for these experiments 
requires that the subject hit with his hand one of 
four toggle switches in response to red or green 
signal lights which may appear in either of two 
positions on a vertical panel. The subject 
starts each response from a constant central 
point and attempts to turn out the presented 
signal light as quickly as possible by hitting the 
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proper switch on a horizontal panel. Both time 
of correct response and errors may be used as 
measures of learning. 

The motor task used here is identical with 
one which has been employed previously in the 
study of several other aspects of the problem of 
transfer of training to motor skills. The ap- 
paratus is described in detail elsewhere (1). 

One group of subjects, a control group, was 
given no verbal training, but was merely told 
which one of the switches should be hit when 
each of the four stimulus lights appeared. This 
group was then given 60 presentations of the 
stimulus lights in random order, with the in- 
structions to try to hit the correct switch as 
quickly as possible after the signal light ap- 
peared. Before each presentation of a stimulus 
light, the subject was given a ready signal which 
served as a two-sec. warning of the appearance 
of the light and as a signal to press down with 
the thumb on a central starting button. An 
electric timer started with the presentation of a 
stimulus light and stopped when the subject hit 
the correct switch. The subject was instructed 
that if the first switch he tried did not turn out 
the stimulus light, he was to rectify his error by 
hitting the correct switch as quickly as possible, 
since the timer continued to run until the correct 
response was made. ‘The time of response was 
recorded for each trial, and the subject was told 
his time of response as well as being allowed to 
read the clock for himself. In addition, the 
nurnber of errors was recorded for each trial, but 
the subject was not informed that this was being 
done. 

Two additional groups of subjects were given 
eight and 24 verbal training trials, respectively. 
This verbal training involved words which ex- 
pressed the stimulus-response relationships oc- 
curring in the motor task. These subjects were 
informed that they were going to be asked to 
learn verbally certain relationships which would 
be used in a later part of the experiment. (No 
mention was made that the experiment would 
ever involve a motor task.) They were told that 
top-red went with right-one, top-green with right- 
two, bottom-red with left-one and bottom-green 
with /eft-two. During the training series, they 
were asked to say as quickly as possible the sec- 
ond pair of words which went with any given 
first pair spoken by the experimenter. The 
pairs of words presented to the subject were 
given in random order and he was required to 
continue responding until he gave a correct 
answer. In order to motivate the subject for 
speed the experimenter pretended to time the 
subject by a stop watch, 

Following the verbal training, the subject was 
taken into the experimental room and shown the 
motor skill apparatus for the first time. It was 
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TABLE I 


Transrer in Teams or Response Time Fottowinc Varyinc 
Amounts or Versa TRAINING 


Means and standard deviations of average response time per trial for successive 10-trial stages in 
the learning of a motor task are presented for three groups of 18 subjects, each group having had a 
different amount of verbal training prior to learning the motor task. 
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58.57 
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pointed out to him that the words presented to 
him in the training series represented the color 
and position of the lights that would appear on 
the stimulus panel of the apparatus; and that 
the pairs of response words which he had learned 
represented the position and number of the 
switches he was to hit in response to these stim- 
ulus lights. The subject was then given 60 
trials on the motor task itself, with response time 
and errors recorded for each trial. 

Each group consisted of 18 female college 
students. The groups were matched on the 
basis of two criteria: a digit-symbol test with 
an 80-sec. time limit, and a reaction time meas- 
ure based on 25 presentations of one of the given 
lights. ‘These two measures had previously been 
found to give a multiple correlation of .42 with 
the average response time for 60 trials on this 
motor task, and a multiple correlation of .35 with 
number of errors made in 60 trials on the task 


(1)? 
ReEsuULTsS 


Table I shows the results obtained 
for the three groups in terms of re- 
sponse time. This table presents the 
means and standard deviations of re- 
sponse time per trial for each 10-trial 


2On the reaction time test, means for the 
three groups ranged from .5622 to .5866 sec., 
while standard deviations ranged from .0649 to 
.0762. On the digit symbol test, means ranged 
from 69.00 to 69.39, while standard deviations 
ranged from 9.83 to 10.95. 





stage of learning, and Fig. 1 presents 
these means graphically. It may be 
seen that, for all three groups, average 
response time per trial is high at first, 
drops rather rapidly: with additional 
trials, and, after a more gradual drop, 
appears to approach a constant level. 
The transfer effects of verbal training 
may be seen by comparing the curves 
obtained by the experimental groups 
with that of the control group. Eight 
trials of verbal training apparently 
produced no transfer effect, since the 
curve is practically identical with the 
control group curve. Differences sig- 
nificant at the .o5 level or better ap- 
pear only during: trials 31-40 and 
51-60. However, the average re- 
sponse time for the group given 24 
verbal training trials is significantly 
lower than that of the control group 
except during trials 31-40; and is sig- 
nificantly lower than that of the 8-trial 
group except during trials 51-60.' 

* The critical ratios used here have been com- 
puted by a method which rules out inter-trial 
variance by computing the standard errors of 


the differences between the means of each of the 
10 trials within each stage, using NV = 10. 





TRANSFER OF VERBAL TRAINING 


— © TRIAL GROUP 
—-<-<-— § TRIAL GROUP 
TRIAL. GROUP 


TIME (0.01 SEC) 


= =) rr) 400 4F80s«C-00 


TRIALS 


Fic. 1. Learning curves in terms of mean response times per trial for successive 10-trial stages of 


learning of a motor task. 


Each curve was obtained for a group of 18 subjects, each group having had 


a different amount of verbal training prior to learning the motor task. 


Table II shows the results obtained 
for the three groups in terms of errors. 
Means and standard deviations are 
presented for the number of errors 
made during each 10-trial stage of 
learning. Fig. 2 is a plot of the data 
of Table II. In general, the two ex- 
perimental groups make fewer errors 
than does the control group, the great- 


est discrepancy appearing during the 
beginning stages of learning the motor 
task. For 10-trial stages these dif- 
ferences are not significant, probably 
because only a few errors were made, 
and the distributions of these errors 
were skewed, resulting in large stand- 
ard deviations. However, when we 
consider half of the learning trials at a 


TABLE II 


TRANSFER IN Teams or Numpea or Errors Mave Fottowinc 
Varyinc Amounts oF VerBAL TRAINING 


Means and standard deviations of the errors made during each 10-trial stage and during each half 
of the learning of a motor task are presented for three groups of 18 subjects, each group having had a 
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MEAN NUMBER OF ERRORS PER 10 TRIALS 
a 


TRIALS 


Fic. 2. Learning curves in terms of mean numbers of errors made during successive 10-trial 
stages in the learning of a motor task. Each curve was obtained for a group of 18 subjects, each group 
having had a different number of trials of verbal training prior to learning the motor task. 


time, a critical ratio of 2.16 is obtained 
for the difference between average 
number of errors made in the first 
30 learning trials by the 24-trial group 
and the control group. Thus, 24 
trials of verbal training provide some 
measurable degree of positive transfer 
to the motor task in terms of errors as 
well as time. The positive transfer 
effect, however, cannot be shown to 
continue after the first 30 trials of 
learning. 
Discussion 


This experiment has been concerned 
with the effects of verbal practice upon 
the subsequent learning of a motor 
task. No transfer effect was found 
following small amounts of training in 
verbalizing the stimulus-response re- 
lationships of the motor task; but 
larger amounts of such verbal train- 
ing were of benefit to the later learning 
of this task. In the present experi- 
ment, transfer was obtained when the 
rehearsal was presumably exclusively 
verbal. The particular experimental 
design which was used allowed for 


restricting the amount of implicit 
motor practice during training, and 
the motor task investigated was one 
which would probably benefit by ver- 
bal training. We do not wish to leave 
the impression that our results and 
conclusions may be considered to 
apply to other kinds of tasks in 
which it would be more difficult to 
formulate appropriate verbalizations. 
As was noted in the Introduction, it is 
difficult to see how certain types of 
tasks, such as basketball shooting, 
could be expected to benefit much by 
verbal training. 

Positive transfer resulted here when 
both the stimuli and responses of the 
training task differed from those of the 
final task. A previous experiment 
(2) has shown comparable results for 
another situation in which training 
and final tasks involve both different 
stimuli and responses. In this previ- 
ous experiment, positive transfer re- 
sulted from practice on a pictured 
representation of the apparatus to a 
motor task very similar to that of the 
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present experiment. The panel con- 
taining the light stimuli was repre- 
sented by four printed circles, one of 
which was colored red or green on any 
given sheet; and the response con- 
sisted of making a pencil mark in one 
of the four printed squares represent- 
ing the positions of the switches. 

An hypothesis making use of the 
concepts of stimulus and response 
generalization was suggested to ex- 
plain the results of the pictured repre- 
sentation experiment. 

In the first part of the hypothesis, 
it was assumed that training on pic- 
tured representation served to: (a) 
strengthen generalized pencil-marking 
response tendencies to the similar 
light stimuli of the motor task, (b) 
strengthen response tendencies similar 
to the pencil-marking response, e.g., 
switch-pressing responses to pictured 
representation stimuli and to similar 
stimuli. The first of these assump- 
tions requires the mechanism of stim- 
ulus generalization, while the second 
makes use of response generalization. 

These same assumptions may be 
applied to the present experiment if 
we make the added assumption that 
generalization may occur along medi- 
ated dimensions. This additional as- 
sumption provides for strengthening of 
a generalized response tendency to a 
particular light stimulus by training 
which establishes this response tend- 
ency to a verbal symbol for the light 
stimulus. Such generalization, of 
course, depends upon previous learn- 
ing in which words such as ‘red’ or 
‘top’ are associated with a particular 
color or position so that the words 
have become functionally equivalent 
to the visual stimuli. Similarly, by 
the mechanism of mediated general- 
ization, a verbal response may 
strengthen the motor response which 
it symbolizes. 

The second part of the hypothesis 
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presented previously concerns the 
fact that the motor task used required 
the learning of discriminative habits. 
In such discriminative learning any 
training which serves to reduce the 
internal generalization within the 
final task (i.e., the tendency to con- 
fuse the stimuli and responses within 
that task) may be expected to result 
in positive transfer. In the present 
experiment as well as the previous 
one training may be considered not 
only to have strengthened similar re- 
sponse tendencies to similar stimuli, 
but also to have reduced the range of 
the generalization gradients involved. 
If such a reduction in the range of 
generalization has occurred, the type 
of training given may have served 
also to increase the discriminability 
of the stimuli and responses of the 
motor task. 


There is no way of telling from these 
experiments whether both stimulus and 
response generalization are necessarily 
important aspects of the explanation. 
However, it is reasonable to suppose that 
reduction of internal generalization 
among stimuli would be particularly im- 
portant in a task where discrimination 
among visual stimuli plays such a major 
role. With an apparatus similar to the 
one used here, an experiment was per- 
formed to explore the transfer effects of 
pre-differentiation among signalling stim- 
uli (3). Positive transfer resulted when 
training consisted of presenting the actual 
task stimuli and requiring verbal re- 
sponses completely unrelated to the 
actual task responses (e.g., the letters V, 
L, M, or N were to be given in response 
to the lights). Thus, in a situation pro- 
viding opportunity to reduce internal 
generalization among final task stimuli 
but not among final task responses, con- 
siderable positive transfer occurred. 
This would suggest that at least part of 
the explanation of the positive transfer 
effects of verbal training lies in the op- 
portunity for reduction of stimulus gen- 
eralization gradients. It is interesting 
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to note that in both the stimulus pre- 
differentiation experiment and the pres- 
ent experiment, small amounts of train- 
ing had no transfer value, whereas the 
larger amounts did benefit the learning 
of the final task. 


SUMMARY 


This experiment investigates the 
transfer effects of varying amounts of 
verbal training upon the subsequent 
learning of a motor task. Previous 
experiments showing positive transfer 
from mental practice to the perform- 
ance of motor tasks have not allowed 
for determination of the relative roles 
of implicit motor responses or verbal- 
izations. By designing this experi- 
ment so as to minimize the possibility 
of using implicit motor responses, we 
were able to study the effectiveness of 
a primarily verbal type of mental re- 
hearsal. The motor task used in the 
experiment was a discrimination learn- 
ing problem in which the subject 
learned to press the appropriate one 
of four switches upon the appearance 
of a red or green stimulus light in 
either of two positions. In the verbal 
training the subject was asked to 
memorize, by an oral paired associ- 
ates method, verbal expressions of the 
stimulus-response relationships of the 
final task. Specifically, the experi- 
menter gave two words representing 
color and position of light, and the 
subject was to respond with two other 
words representing the position of the 
correct switch. During verbal train- 
ing the subject did not know that a 
motor task was to follow. Three 
groups of 18 subjects received o, 8 and 
24 verbal training trials, respectively, 
prior to 60 trials on the motor task. 


1. When time was used as the meas- 
ure of learning, no significant transfer 
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effects were found following eight ver- 
bal training trials; but 24 verbal train- 
ing trials did yield a significant amount 
of positive transfer to the perform- 
ance of the motor task. 

2. Considering the total number of 
errors made during the first 30 trials 
on the motor task, we found no trans- 
fer from eight verbal training trials, 
but a significant amount of transfer 
from 24 verbal training trials. 

3. The present results were inter- 
preted in terms of the theoretical 
concepts of stimulus and response 
generalization. 


(Manuscript received August 19, 1949) 
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SHIFTING OF DISTRIBUTION OF PRACTICE 
IN MIRROR DRAWING! 


BY J. C. TSAO 


Hong Kong Unwersity 


INTRODUCTION 


In testing the deduction from 
Snoddy’s (2) theory, that, since 
‘‘primary growth comes early and 
is enhanced by interpolated time 
.. « (and) secondary growth comes 
later and is enhanced by reducing 
the interpolated time,” early spaced 
and later massed practice should be 
more efficient than early massed and 
later spaced trials, Doré and Hilgard 
(1) found that the final achievement 
in pursuit learning of an early concen- 
trated and later distributed practice 
group was significantly higher than 
that of the group with the reversed 
distribution of trials. In a reply to 
Doré and Hilgard, Snoddy (3) main- 
tained that the early spaced and later 
massed practice group was handi- 
capped by interference effects which 
occurred because the S “twas first 
adapted to time-interval conditions 
and then shifted to continuous prac- 
tice conditions in the same practice 
sitting’ and further predicted that 
“had these authors (Doré and Hil- 
gard) removed the interference by 
interpolating an over-night interval 
and then measured the relatively per- 
manent efficiency immediately follow- 
ing the interval they would find their 
two groups at the same efficiency 
level.” 

The keynote of Snoddy’s contention 
was that interferences were due to 
shifting from one practice condition to 


1 This is the seventh article in a series of 
studies in spaced and massed learning. The 
study was carried out in the Psychological 
Laboratory, Cambridge, England; the writer 
is indebted to Mr. G. C. Grindley for help and 
advice. 


another in the same sitting, and that 
the interference effects could be re- 
moved by interpolating an over- 
night interval. According to Snoddy, 
therefore, if the changing-over were 
not in the same sitting, and a 24-hour 
interval were to be interpolated be- 
tween the practice periods, the results 
should be differerent from those ob- 
tained by Doré and Hilgard. In 
order to test this deduction, the pres- 
ent experiment was carried out.” 


EXPERIMENT 


The experiment of the present study was on 
mirror drawing, the setup of which was described 
in the first paper of this series (4). A brief re- 
capitulation should suffice. ‘The apparatus used 
was shown in the former paper; in general it was 
slightly different from the ordinary star-pattern 
tracing sheet. The learning score was 1/12 of 
the actual time, in terms of sec., taken by the S 
in each trial of tracing, which corresponded to 
the mean tire required in tracing from the 
centre of the apparatus to one of the periperhal 
circles and back to the centre. 

The Ss were 64 R.A.F. air-crew who formed 
four groups of 16 people each, which were desig- 
nated Groups E, F,G, and H. Groups E and F 
practised 12 trials in one sitting, and Groups G 
and H in two sittings with an interval of 24 hours 
interpolated between the 6th and 7th trials. 
In Groups E and G, trials 1-6 were spaced with 
one-min. intervals and trials 7-12 were continu- 
ous; in Groups F and H the procedure was re- 
versed, i.c., trials 1-6 were continuous and trials 
7-12 spaced. 


RESULTS 


The average scores of each group 
are given in Table I and they are pre- 
sented also in a graphic form in Fig. 1. 
Since the scores are time-records and 


? When this paper was going to be sent to the 
editorial office, the writer read the article of Cook 
and Hilgard (J. exp. Psychol., 1949, 39, 169- 
172), which is on the same line as the present one. 
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TABLE I 
Averace Scores or Mirror Drawinc 














I “ss 3 4 





16.4 | 10.6 | 7.3 
16.9 | 10.3 | 8. 
16.1 9.2 | 

16.1 | 14.5 








consequently denote the reciprocals 
of the speed of tracing, a larger figure 
in the table or a higher point on the 
curves means a lower efficiency. 
From the scores and the learning 
curves, it can be seen that at the com- 
pletion of the 6th trials the groups 
having early spaced practice, i.e., E 
and G, were superior to those having 
early massed practice, F and H. The 
differences between Groups E and F 
and between G and H in the scores of 
the 6th trials are significant statisti- 
cally, as shown in the following: 


6th trials of Groups E and F: Diff. 
= 6.5-4.6 = 1.9; C.R. = 4.60. 

6th trials of Groups G and H: Diff. 
= 6.7-4.3 = 2.4;C.R. = 4.20. 


With regard to final achievements, 
groups which had early massed and 
later spaced practice were slightly 
superior to groups which had early 








Fic. 1. Learning curves of mirror drawing 


spaced and later massed practice. 
The final score of Group F was a little 
higher than that of Group E, and those 
of Groups G and H were at about the 
same level. 


12th trials of Groupe f E and F: Diff. 
=3.9-3.2 = 0.7; C.R. = 1.90. 

12th trials of Groups G and H: Diff. 
= 3.5-3.4 = 0.1; C.R.=0.44. 


The gain in efficiency or drop in 
score from the 6th trial to the 7th 
trial was in Group E 0.3, in Group F 
1.0, in Group G 0.1, and in Group H 
0.9. It should be borne in mind that 
the performance of the 7th trial after 
the 6th in Groups E and G involved a 
shifting from spaced to massed prac- 
tice, and in Groups F and H a shifting 
from massed to spaced practice, and 
that the shifting in Groups E and F 
took place in the same sitting, while 
in Groups G and H an over-night 
interval was interpolated. The re- 
sults seem, therefore, to indicate that 
larger gains were made when the 
shifting was from massed to spaced 
than when from spaced to massed 
practice (1.0 and 0.9 vs. 0.3 and 0.1); 
and that it made little difference 
whether the shifting took place in the 
same sitting or after an overnight 
interval (0.3 and 1.0 vs. 0.1 and 0.9). 


Discussion 


The results of Groups E and F in 
the present experiment agreed with 
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those obtained by Doré and Hilgard, 
which showed that early spaced and 
later massed practice was not so 
efficient as early massed and later 
spaced practice. The shifting from 
one to another form of distribution of 
practice in Groups G and H did not 
take place in the same sitting, and 
though the final achievements of 
these two groups were at about the 
same level the results were not funda- 
mentally different from those of 


Groups E and F in which the shifting 
did occur in the same sitting. 


When Snoddy made the statement that 
had Doré and Hilgard interpolated an 
over-night interval and measured the re- 
latively permanent efficiency immedi- 
ately after the interval “they would find 
their two groups at the same efficiency 
level,” he might mean that if Doré and 
Hilgard measured again the efficiency on 
the next day after the experiment they 
would find that the early spaced and 
later massed group had caught up with 
the other one. It was a pity that in the 
present study we could not test Groups 
E and F in this way because of some dif- 
ficulty in arranging the programme. 
There are some reasons, however, to 
doubt the effects which Snoddy attrib- 
uted to an over-night interval. Firstly, 
the results of the present study showed 
that 24-hour intervals in Groups G and 
H did not cause greater gains from the 
6th to 7th trials than the corresponding 
improvements in Groups E and F which 
did not have such long pauses. Sec- 
ondly, Snoddy cited the results of an 
early experiment of his (2) as the evidence 
for the interference effects due to trans- 
ference from one practice condition to 
another in the same sitting. In his ex- 
periment, two over-night intervals were 
interpolated, after the second of which 
there were losses in efficiency in both of 
the paired repetitions; and after the first 
24-hour interval there was a loss in the 
first trial and not in the second, though 
Snoddy thought that there should be a 
loss in the second trial as well. These 
results may indicate that it is unlikely 
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that the early spaced and later massed 
group could catch up with the rival 
group should they be tested the second 
day in the experiment of Doré and Hil- 
gard. 

According to Snoddy’s theory, early 
massing is extremely unfavourable, be- 
cause on the one hand it does not provide 
enough time intervals for primary growth 
to develop and on the other hand it may 
facilitate secondary growth which can- 
not, however, be enhanced in an early 
period and without an appropriate base, 
i.e., a certain amount of primary growth. 
Late spacing is also disadvantageous 
since the unstable secondary growth may 
fade away during the later intervals. 
Yet Doré and Hilgard found that early 
massed and later spaced was more effici- 
ent than early spaced and later massed 
practice, and the results of Groups E and 
F in the present study rather confirm 
their findings. When the conditions 
proposed by Snoddy were fulfilled in 
Groups G and H of the present study, 
the early spaced and later massed group 
still failed to show its superiority over 
the early massed and later spaced group. 
Snoddy attributed the lower efficiency 
of early spaced and later massed group in 
the experiment of Doré and Hilgard to 
interference effects and took the results 
of one of his early experiments as an 
evidence. In his early study, however, 
Snoddy interpreted the interference ef- 
fects as due to the dominance of time 
intervals interpolated before paired trials. 
It is very difficult to apply the same 
interpretation to the results of Doré and 
Hilgard since the conditions in the two 
studies were quite different. Further- 
more, time intervals in the early spaced 
practice could not dominate during the 
whole course of the later massed prac- 
tice; if it did so, secondary growth could 
never take place. 

Since, in his reply to Doré and Hil- 
gard, Snoddy stressed the effects caused 
by the fact that the S was first adapted 
to time-interval and then shifted to con- 
tinuous practice condition in the same 
practice sitting, it seems that he might 
mean that the interferences were due at 
least partly to the disturbance of the S's 
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adaptation to the practice conditions. 
This is a psychological interpretation and 
is not congruent with his theory in gen- 
eral, according to which “earliness en- 
hances primary growth, and lateness and 
repetition alone, the secondary form.” 
Moreover, the results of Groups G and 
H of the present study made this inter- 
pretation, were it Snoddy's, unlikely to 
be tenable. 


SuMMARY AND CONCLUSIONS 


Four Groups, E, F, G, and H, of 16 
Ss each practised on mirror drawing 
for 12 trials. In Groups E and G, 
trials 1-6 were spaced with one-min, 
intervals and trials 7-12 were contin- 
uous; in Groups F and H, the proce- 
dure was reversed. Groups E and F 
performed the 12 trials in one sitting, 
and Groups G and H each had an over- 
night interval interpolated between 
the 6th and 7th trials. 

In final achievements, Groups G 
and H were about the same, and 
Group F slightly exceeded Group E. 
The results of Groups E and F agreed 
with those of Doré and Hilgard, and 
the results of Groups G and H failed 
to support Snoddy’s supposition. 
The results naturally lead us to the 
conclusions that early massed and 
later spaced practice may be as ef- 
ficient as, or even more efficient than, 
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early spaced and later massed practice 
in mirror drawing, and that Snoddy’s 
hypothesis regarding the interferences 
between primary and_ secondary 
growths is questionable. The latter 
conclusion may throw some doubts 
on the whole system of Snoddy’s the- 
ory. It may be true that time inter- 
vals and practice enhance different 
aspects or processes of learning, which 
may be designated as primary and 
secondary growths or by any other 
names, but it is doubtful whether 
these different aspects or processes 
in learning follow the “rule of earli- 
ness and lateness” and are “distinctly 
opposite in character’ and “always 
related to each other in terms of plus 
and minus,” as assumed by Snoddy. 


(Manuscript received 
September 15, 1949) 
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THE RELATIONSHIP BETWEEN ORDER AND FREQUENCY 
OF OCCURRENCE OF RESTRICTED 
ASSOCIATIVE RESPONSES 
BY W. A. BOUSFIELD AND W. D. BARCLAY! 


University of Connecticut 


The purpose of this study was to 
test the following hypothesis: for a 
group of Ss having a generally similar 
cultural background, the order of occur- 
rence of restricted associative responses 
from the individuals should vary posi- 
tively with the frequencies of occurrence 
of the responses from the group. In 
other words, the most commonly given 
responses should occur early in indi- 
vidual sequences and those of lower 
frequencies should occur later. 

The term restricted associative re- 
sponses was employed by Bousfield 
and Sedgewick (1) to denote sequences 
of verbal associates given by Ss when 
their instructions stipulated the rele- 
vance of the responses. Sample in- 
structions were: to “name as many 
quadruped mammals as you can”; to 
“name as many cities in the United 
States as you can”; and to “name as 
many pleasant objects, activities and 
situations as you can.” Analysis of 
the responses indicated a negatively 
accelerated rate of production which 
the writers interpreted as being a func- 
tion of the number of remaining avail- 
able associates. In describing this 
type of situation we may postulate 
that the S, in each case, draws upon a 
pool of available relevant associates. 
As habits, these associates have been 
subjected to various degrees and types 
of reinforcement, and hence differ in 
strength. The reinforcements of ver- 
bal responses are largely social in char- 
acter. Given a group with generally 


! The writers would like to acknowledge their 
indebtedness to Dr. Charles E. Osgood for sug- 
gestions leading to the design of this experiment. 


similar cultura! background, we would 
expect a degree of similarity with re- 
spect to factors reinforcing verbal 
habits. Certain associative responses 
would be more extensively reinforced 
than others and hence more likely to 
occur. As Hull (2) has indicated, 
probability of occurrence is an index 
of habit strength. There are also 
grounds for assuming that habit 
strength should determine the order 
in which the S draws upon a pool of 
associative responses. This assump- 
tion may be derived as a corollary of 
the principle that probability of oc- 
currence is an index of habit strength. 
The S beginning a sequence of relevant 
responses would be most likely to give 
those having greatest strength. As 
the sequence progresses, the supply of 
stronger habits becomes reduced. 
The conditions of the experiment in- 
hibit renetition. The S must then 
draw upon those associates of weaker 
strength, and here again relative 
strength would determine probability 
of occurrence. The S thus tends to 
give his strongest associates first and 
his weakest last. 

Assuming the validity of this analy- 
sis, both frequency and order of oc- 
currence are related indices of habit 
strength, and for a group of Ss whose 
verbal habits have received more or 
less similar reinforcement by virtue 
of a common culture, these indices 
should correlate positively. We may 
further assume that the requirement 
of cultural similarity is adequately 
met by using as Ss, native undergradu- 
ate students (mostly with middle class 
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and lower middle class backgrounds) 
attending their State University. 


PROCEDURE 


The procedure for obtaining data was similar 
to that employed by Bousfield and Sedgewick 
(1). Undergraduate students in psychology 
courses served as Ss. The experiment was con- 
ducted in a large class room regularly used for 
the meetings of the courses. Each S was pro- 
vided with a pencil and an 8§ X 11 in. piece of 
plain paper. £, provided with a stop watch, 
gave the following instructions, supplementing 
them whenever it appeared necessary by ex- 
planatory comments. 


1. You are asked not to write your name. 
Please indicate your age and sex, however, on 
the paper that has been given you. 

2. This experiment is being undertaken in 
order to find out something about a certain 
type of thinking. It will involve a problem 
similar to the following: write all the first 
names you can think of beginning with the 
letter 4. Thus you might give Arthur, Alfred, 
Alice, Abraham, Andrew and soon. You are 
not being asked to work on this problem, but 
if you were, you would proceed by writing the 
names in a column at the left of the paper. At 
intervals I would say, “Draw a line.” On 
hearing this signal, you would draw a short 
horizontal line below the last name you had 
written, and then continue with more names. 
[EZ illustrates the procedure on the black- 
board.] In the event you had thought of no 
new names since the last instruction, “Draw 
a line,” you would draw another line just the 
same. [E again illustrates the procedure on 
the blackboard. ] Other possible instructions 
might be to name animals, trees, or as many 
parts of an automobile as possible. You are 
to begin writing as soon as you hear the word 
“Go!” Are there any questions? 


Having answered such questions as appeared 
relevant, E continued with specific instructions 
for the production of associates. A total of 18 
min. was allowed for the writing, the signals for 
drawing demarcation lines being given at inter- 
vals of two min. The three types of instruction 
were as follows: 

a. For Group I (100 Ss): List the names of as 
many birds as you can. Ready, go! 

b. For Group II (58 Ss): List the names of as 
many carpenter’s tools as you can. Ready, go! 

c. For Group III (60 Ss): List the names of as 
many celestial (that is, heavenly) bodies as you 
can. Ready, go! 
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ReEsutts 


The first step in the treatment of 
data was to obtain the frequencies of 
the various associates. Table I pre- 
sents the 50 most frequently occurring 
associates foreach group. In making 
the frequency counts no distinction 
was made between singular and plural 
forms of associates, i.e., the lexical 
unit was employed. Misspellings 
were ignored. Inspection of the lists 
revealed a considerable proportion of 
what might be termed mediately rele- 
vant associates. These were most 
pronounced in the low frequency 
categories. In the lists of birds, there 
were, for example, 4 katydids, 3 duck- 
bill platypuses, 2 gazelles, and 2 jail 
birds. The associates of lower fre- 
quency in Table I indicate consider- 
able flexibility in classification. 

The next step in the treatment of 
data was to transcribe the frequencies 
onto the original individual lists. 
The successive associates of these 
lists were numbered 7, 2, 3, 4, 5, .. . 
m to indicate order of occurrence. 
These numbers are here referred to as 
the ranks of the associates. The fre- 
quencies and ranks of all associates 
for a group of Ss were then recorded 
on a scatter diagram using intervals 
of five for both frequency and rank. 
From this diagram the mean rank was 
computed for each interval of fre- 
quency. The data thus condensed 
appear in Table II and Fig.1. It may 
be noted that there are gaps in the 
data for several of the frequency inter- 
vals. The data indicate, however, a 
pronounced tendency for associates of 
high frequencies to occur relatively 
early (to have low ranks), and for 
those of progressively lower frequen- 
cies to occur progressively later (to 
have higher ranks) in the three types 
of associates. The variability of the 





ORDER AND FREQUENCY OF ASSOCIATIVE RESPONSES 


TABLE I 


50 Most Freqventiy Occurainc Associates ror Eacu Group 








Group I (100 Ss) Birds 


Group II (58 Ss) Carpenter's Tools 


Group III (60 Ss) Celestial Bodies 





Associate 





Blue Bird 
Eagle 
Sparrow 
Canary 
Humming Bird 
Blue Jay 
Black Bird 
Parrot 
Hawk 
Pheasant 
Pigeon 
Woodpecker 
Owl 


Oriole 
Duck 

Sea Gull 
Starling 
Swallow 
Chicken 
Dove 
Goose 
Ostrich 
Wren 
Chickadee 
Turkey 
Thrush 
Peacock 
Kingfisher 
Penguin 
Albatross 
Cat Bird 
Quail 


Whippoorwill 

Crane 

Mocking Bird 
Nightingale 

Red Wing Black Bird 
Raven 

Bob White 

Heron 

Grouse 








Hammer 
Screwdriver 
Saw 
Chisel 
Nails 
Rule(er) 
Plane 
Pencil 
Drill 
Screws 
Pliers 
Sandpaper 
File 


Putty Knife 
Knife 

Saw Horse 
Chalk 

Nuts 

Paint 
Compass 
fis Saw 


aint Brush 


Hatchet 
T-Square 
Blue Prints 
Nail Set 
Miter Box 
Plumb Line 
Protractor 
Rope 

Band Saw 
Brush 
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North Star 
Little Dipper 
Milky Way 
Mercury 

Stars 

Neptune 
Halley’s Comet 
Comet 

Orion 
Southern Cross 
Juno 
Capricorn 
Sagittarius 
Uranus 


Meteor 

Planets 
Taurus 

Cancer 
Morning Star 
Aries 

Big Bear 
Cassipocia 
Clouds 

Pisces 

Scorpio 

Zenith 

Apollo 

Aurora Borealis 
Evening Star 
Gemini 

Little Bear 
Northern Cross 
Northern Lights 
Pleiades 

Vir 

Alpha Centauri 
Betelgues 
Castor 

Eastern Star 
Pollux 





9 
8 
8 
7 
7 
6 
6 
5 
5 
5 
5 
5 
4 
4 
4 
4 
4 
4 
4 
4 
4 
3 
3 
3 
3 
3 





data would appear to be due in part associates, and the 60 Ss listing celes- 
to the sizes of the samples. For the tial bodies averaged 17.32. Obvi- 
lists of birds, 100 Ss had a mean of ously other factors contributed to 
35-97 associates per S. The 58 Ss_ variability and further study would be 
listing carpenter’s tools averaged 24.60 necessary to isolate them. 
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TABLE II 


Frequency ano Orper or Occurrence (Rank) or Associates 
in Sequences or Associates 








Group II (58 Ss) Group III (60 Ss) 

» | Carpenter's Tools Celestial Bodies 
Frequency of 

Occurrence to ee ~ 

| 





Mean Rank No. Mean Rank No. Mean Rank 





29.0 
25.5 
25.8 
26.2 
23.7 
24-4 
22.9 
24.7 
23.2 
22.0 
19.7 
21.0 
16.5 
18.3 











Responses 














a 


CELESTIAL 
BODIES 








26-30 8I-86 
FREQUENCY OF OCCURRENCE 


lic. 1. Relationship between frequency and order of occurrence (rank) of 
associates. Gaps in the data are indicated by broken lines. 
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CoNCLUSIONS 


Assuming the validity of the experi- 
mental design and the treatment of 
data, we reach at least two conclu- 
sions: 


1. The data support the experi- 
mental hypothesis which was form- 
ulated as follows: for a group of Ss 
having a generally similar cultural back- 
ground, the order of occurrence of re- 


stricted associative responses from the 
individuals should vary positively with 
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the frequencies of occurrence of the re- 
sponses from the group. 

2. The data indicate the feasibility 
of the analysis of sequences of re- 
stricted associative responses from the 
standpoint of learning theory. 


(Manuscript received August 30, 1949) 
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RETENTION OF PURSUIT ROTOR SKILL AFTER ONE YEAR 


BY HUGH M. BELL 
Department of Psychology, Chico State College 


The retention of motor skill has been 
studied by numerous experimenters 
since the early work of Book and 
Swift on typewriting in which a high 
degree of retention was found even 
after a lapse of two years. A sum- 
mary of significant studies on long- 
time retention in motor skills has 
been made by McGeoch (2). 

A study (1) of the effect of rest 
pauses of varying lengths interpolated 
early and late in learning on the ac- 
quisition of pursuit rotor skill has been 
reported by the writer. In this in- 
vestigation 457 subjects were given 
20 trials of one min. each separated by 
a one-min. rest except where longer 


rests were introduced. For the early 
rest groups, rest periods of 10 min., 
one hour, six hours, 24 hours, and 30 
hours were used after the fifth trial, 
and for the late rest groups, rest peri- 
ods of the same duration were em- 
ployed following the fifteenth trial. 
The control group used in this study 
rested one min. after each trial. 

The present experiment consists of 
a retest after one year of 47 of the 
subjects who participated in the first 
experiment. Each subject was given 
20 one-min. trials on the pursuit rotor 
separated by rest periods of one min. 
each. It was not possible to obtain 


enough subjects from any one of the 
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Fic. 1. Retention of pursuit rotor skill after one year. 


April 1940. 





20 «(2k 25 » 5) 


Left: Trials 1-20, 


Right: Trials 21-40, April 1941 
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RETENTION AFTER ONE YEAR 


original groups and hence the scores 
in this experiment consist of some indi- 
viduals from each of the groups. 


RESsULTs 


In Fig. 1 the initial learning curve 
of the 47 subjects for trials 1 through 
20 is shown. This curve is very simi- 
lar to the curve of the control group 
from the original experiment in that it 
shows an initial rapid rise followed by 
a gradual leveling-off as the final 
trial is approached. 

Shown also in Fig. 1 is the retention 
curve of these same subjects after one 
year represented by trials 21 through 
40. At the 21st trial (the first after 
the intervening year) the average score 
dropped to the level of trial 9 of the 
original series, but the next three trials 
show a very rapid recovery of the 
skill so that by the 28th trial the group 
has regained all that was lost over the 
one-year interval. From the 29th 
through the goth trials the group con- 
tinued to show a slight improvement 
in their performance on the rotor. 

Analysis of the scores of these sub- 
jects indicates that they were some- 
what superior to the control group 
used in the first experiment as their 
mean score at trial one was 71.9 com- 
pared with an average of 51.4 for the 
control group. This superiority was 
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maintained throughout the 20 trials 
of the original experiment and at trial 
20 their average was 479.7, 23 points 
above that of the control group. 

After the year interval, the initial 
trial of the present experiment, trial 
21, shows a drop to 340.4, a loss of 
139.3, Or 29 percent. By trial 28 the 
mean was 479.4, the point reached by 
the group at the final trial of the 
original learning series. From trials 
29 through 40 the group gained an 
additional 49 points, reaching an 
average score of 529.9 at the final 
trial. 

ConcLusIons 


Over the one-year period following 
20 one-min. trials scores dropped 
only 29 percent, and this was com- 
pletely recovered after eight trials. 
Slight gains continued during the re- 
maining trials. The long-lasting re- 
sults of a brief practice period are 
convincingly demonstrated for this 
motor skill. 


(Manuscript received October 3, 1949) 
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SOME EFFECTS OF SHAPE ON APPARENT BRIGHTNESS * 


BY R. M. HANES 
The Johns Hopkins University, Institute for Cooperative Research 


INTRODUCTION 


While there is considerable contra- 
dictory evidence, numerous studies 
have produced results which indicate 
that the threshold for recognition of 
a visual test object is influenced to 
some extent by the shape of the ob- 
ject. Kleitman and Blier (5), for 
example, found that triangles, circles, 
squares, and stars of approximately 
the same area can be discriminated in 
the order given. For binocular dis- 


crimination of two objects presented 
simultaneously Whitmer (7) found 
that the rank of different shapes based 
on the percent of correct discrimina- 
tions was as follows: triangle best, 
then diamond, square, rectangle, cir- 
cle and hexagon. 


King, Landis, and 
Zubin (4), using only squares, tri- 
angles, and circles, found subliminal 
perception (forced guesses) to exceed 
chance. While the results for the 
various figures did not differ signifi- 
cantly, the triangle did show up as 
somewhat better than the other two. 
Believing that the conflicting reports 
of previous investigators were due to 
lack of consideration for area of figure 
and specific framework effects, Hoch- 
berg, Gleitman, and Macbride (3) 
attempted to maintain better control 
over these factors. In this case 
threshold for form was found to 
increase with decreasing simplicity 
or compactness, as defined by the 
ratio of perimeter to area. The cir- 
cle was found to have the lowest 


* This research was carried out under Con- 
tract N5-ori-166, Task Order I, between Special 
Devices Center, Office of Naval Research, and 
The Johns Hopkins University. This is Report 
No. 166-I-109, Project Designation No. NR-784- 
oo1 under that contract. 


threshold, the square next, and the 
cross the highest. Langstroth et al. 
(6), on the other hand, report that 
there has been no difference found in 
the probability of detection for square, 
rectangular, triangular, and disc- 
shaped targets when target area is 
constant. 

While this brief review is not ex- 
haustive, it serves to indicate that 
limited agreement has been found. 
Triangles, perhaps, have been most 
persistent in giving lower thresholds. 
Whether threshold results, even if 
consistent, should prevail above 
threshold is open to question; the 
threshold experiments are cited pri- 
marily as being suggestive. But if 
‘compactness’ is a factor in detection 
at threshold, it might also help to 
determine apparent brightness. If 
apparent size is a determiner of 
threshold for form, it might also 
be a factor in apparent brightness. 
The present study was undertaken in 
an attempt to answer some such possi- 
bilities. 

APPARATUS 

Two luminous opal glass screens, presented 
simultaneously, served as the stimuli. The light 
source for each was a 150-watt projection bulb 
housed in a metal box. The light from each bulb 
passed successively through a first sheet of opal 
glass, a diamond-shaped diaphragm, and a sec- 
ond sheet of opal glass, this latter glass being the 
one viewed by the observer. The different stim- 
ulus shapes and sizes were obtained by mounting 
photographic slides on the opal glass. Each 
shape and size was reduced photographically 
from an enlarged (ten to one) original. Circles, 
triangles, and squares were the forms used. 
Each was made in three different areas: 0.0031, 
0.0123, and 0.7854 square in. These three areas 
were determined from circles of pg, $, and one in. 
diameters, which at the viewing distance of 24 
in. subtended visual angles of about 9, 18, and 
144 min., respectively. The }-in. size was used 
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to make certain that shape would be clearly rec- 
ognized even at the higher brightness levels; the 
Ys-in. offered a slight possibility of confusion. 

The two stimuli were located about 4} in. 
apart at the end of a rectangular (34 by 8 in.) box 
lined with black velvet, whose average reflect- 
ance was less than one percent. This box was 
attached to a large metal bowl at the end op- 
posite the stimulus discs; thus, the O could see 
only the two luminous stimuli surrounded by a 
dark field. Brightness calibration was made 
with the use of a Macbeth I!luminometer. 

The brightness of each stimulus could be 
varied continuously either by O in front of the 
apparatus or by E at the rear. Wratten neutral 
density filters were used for gross changes in 
intensity level. 


ProcepuRE 


O was seated comfortably in front of the ap- 
paratus upon entering the dark experimental 
room and given several practice trials to acquaint 
him with the nature of the task. He was then 
presented with two stimuli of either the same or 
different shape but of the same area and told to 
adjust whichever one served as variable until its 
brightness was equal to that of the standard. 
No time limit was imposed on the trials. Each 
shape was compared with itself and both other 
shapes of the same area, as both standard and 
variable, on both the right and left, at each of 
three brightness levels: 0.1, 10, and 100 mL. 
There were thus 162 comparisons for each O. 
The stimuli were extinguished between compari- 
sons while a new pair was being set. 


OsseRVERS 


Five male college students, between 20 and 
25 years of age, served as Os. All had normal 
visual acuity or were corrected for defects. 
None had any experience in psychological 
experiments. 


REsuULTs 


The principal data are presented in 
Table I, where the results for five ob- 


servers have been averaged. The 
table is to be read as follows: When 
the standard was a triangle having 
an area of 0.0031 sq. in. and a lumi- 
nance of o.1 mL., the mean value 
for an equal-sized square as variable 
which appeared equally bright was 
0.125 mL., but for a triangle the mean 
value was only 0.095 mL. The tri- 
angle, therefore, appeared brighter 
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than the square. The means of the 
rows, along the right side of the table, 
show the average values required for 
each indicated shape to match a com- 
posite standard at each indicated lu- 
minance. The column means in each 
section of the table represent the aver- 
age values required to match the indi- 
cated standards. The row means 
reveal that, when an average is taken 
over all sizes, the square always ap- 
pears least bright and the triangle and 
circle generally do not differ very 
much. The column means indicate, 
however, that while the same relation- 
ship holds true in the majority of 
cases, the shape which appears bright- 
est is somewhat dependent upon the 
size and brightness level. 

The settings for each observer under 
each of the various conditions were 
expressed as percentages of the stand- 
ard and analyzed by means of the 
analysis of variance. This analysis 
revealed that the over-all differences 
mentioned above among the three 
shapes were significant beyond the 0.1 
percent level of confidence. In other 
words, the square was significantly 
less bright than the triangle and the 
circle. A t-test showed that the dif- 
ference between the apparent bright- 
ness of the triangle and the circle was 
not significant. The differences due 
to brightness level were also found to 
be highly significant; again a t-test 
showed that this effect was due pri- 
marily to the difference between the 
o.1 mL. level and the othertwo. This 
same type of change in constant error 
in this kind of test situation, has been 
noted before (1,2). The over-all 
differences among Os were not signifi- 
cant, and the main effects due to the 
different sizes were barely significant 
(about the five percent level of con- 
fidence). 

The first-order interactions between 
Os and both size and brightness were 
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TABLE I 
Retationsnip Berween Apparent Bricutness anv SuHape ror Turee Sizes 
or Stimuut at Turee Bricutness Levers 


Each entry in the body of the table is the mean value in millilamberts for five Os 
which appeared equal in brightness to the indicated standard. 








Standard 





Shape Square 


Triangle | 





Area (9q. in.) O31 | 123 ¢ 


| 
0031 | 0123 | -7854 0031 
' ' | 





Variable 





0.1 mL. level 





Square 
Triangle 
Circle 





Mean 


Square 
Triangle 
Circle 








Mean 





125 
.100 
.123 




















Square 
Triangle 
Circle 
Mean 


93 
94 
66 


significant, however. In other words, 
different Os varied in their matches 
depending upon the particular size and 
brightness involved. On the other 
hand, the interaction between Os and 
shape was only barely significant at 
the five percent level of confidence. 
The interaction between size and 
shape was, however, highly significant, 
indicating that the noted differences 
among the column means in Table I, 
as a function of size, are not chance 
variations. In fact, the sums of 
squares for the interaction of size 
and shape were larger than any of 
the other sums. Which shape ap- 
pears brightest, therefore, is largely 


102 


124 
11S 








114 





dependent upon the size involved in 
the comparison. Nevertheless, as in- 
dicated by the column means in Table 
I, only in one of the comparisons does 
the square show up as brighter than 
the triangle. The superiority in ap- 
parent brightness for the circle as 
compared with the square is more 
equivocal. 

Table II is presented for the purpose 
of allowing for a more readily discern- 
ible picture of the effects of shape on 
apparent brightness. In this table 
each entry is the mean of the three 
size means for each standard and 
variable at each brightness level. 
The greater apparent brightness of 
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TABLE Il 


Rerationsuie Between AppaRENT 
Bricutness AND SHAPE AT THREE 
Bricutness Leve.s 
Each entry in the body of the table is the 
mean value in millilamberts of five Os 
which appeared equal in brightness 
to the indicated standard. 








Standard 





Variable | Square 


Triangle | Circle 
' 





0.1 mL. level 





.106 121 





10 mL. level 





Square 
Triangle 
Circle 


Mean 








100 mL. level 


102 | 104 
Triangle 100 104 
Circle 


Square 











the triangle and circle as compared 
with the square is more clearly shown 
here by both the row and column 
means. There is no significant inter- 
action between rows and columns, 
thus indicating that the same relation- 
ships for shape and apparent bright- 
ness are found regardless of which 
shape is used as the standard. 


Discussion 


It seems quite clear that the form of 
a luminous stimulus can influence the 
apparent brightness of that stimulus. 
And while no one shape (of the three 
studied here) has been found to appear 
brighter than the others under all 
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conditions tested, nevertheless the 
triangle, and probably also the circle, 
definitely appears brighter on the 
average than does the square. If any 
superiority in threshold sensitivity 
exists for form, it is probably the 
triangle which has been shown most 
often to give the lowest thresholds. 
On this point, at least, there would 
seem to be some agreement between 
threshold and supra-threshold effects. 

Why should these differences in ap- 
parent brightness occur? Since area 
remains the same for the three shapes, 
explanation for the effects must be 
based on differences in perimeters, 
maximum dimensions or apparent 
size. It seems that all three pos- 
sibilities must be invoked to account 
for the main effects and interactions 
found here. The enhanced brightness 
of the triangle at the two smaller sizes 
could be accounted for by the longer 
dimensions of the triangle relative to 
the square and circle. If border con- 
trast were a primary factor in deter- 
mining apparent brightness, and het 
total extent of a given contrast also 
influenced apparent brightness, then 
the triangle should appear brightest. 
That the triangle does not also appear 
brightest when the largest size is con- 
sidered would make this hypothesis 
questionable except that a further 
factor may become operative at this 
size. It was determined in a previ- 
ous experiment that apparent bright- 
ness increased with increasing size at 
low intensity levels (0.1 mL.) but that 
this relationship was in general re- 
versed at higher levels (100 mL.). 
Now with the two smaller sizes studied 
in the present experiment there is a 
negligible difference in apparent size 
among the three shapes. When the 
largest size is considered, however, the 
triangle looks considerably larger than 
the other two. Therefore, with the 
large size and higher intensity levels, 
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the triangle, since it looks larger, might 
well have its apparent brightness di- 
minished. The data substantiate, to 
a degree at least, such a hypothesis. 
The fact that the corners of the tri- 
angles would extend farther into the 
periphery than would the edges of 
either the square or circle, and would 
therefore stimulate more rods and less 
cones, could also cause the apparent 
brightness of the triangle to be reduced 
relative to the other figures at higher 
intensity levels. 

These explanations do not satis- 
factorily allow for the marked differ- 
ence between the circle and the other 
two at the o.1 mL. level and largest 
size. No satisfactory explanation for 
this discrepancy seems apparent. It 
is obvious that these various relation- 
ships are quite complex and that no 
single explanation is adequate for the 
influence of form on either thresholds 
or apparent brightness. 


SUMMARY 


The influence of shape or form on 
apparent brightness has been investi- 


gated for three geometrical forms 
(square, triangle, and circle) at three 
intensity levels (0.1, 10, and 100 mL.). 
Comparisons were made only among 
forms of equal area. 

The triangle and circle, on the aver- 
age, exhibited an enhanced brightness 
relative to the square. These differ- 
ences do not, however, remain the 
same for all sizes and all brightness 
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levels. In other words, there are some 
significant interactions among the 
three factors. When the two smaller 
sizes are considered the triangle has a 
consistently higher apparent bright- 
ness than do the other two. For the 
largest size the circle*appears to be 
brightest. 

While the results can be partially 
explained in terms of marginal con- 
trast and apparent size, no simple 
single explanation seems adequate to 
account for the results. 


(Manuscript received for immediate 
publication July 7, 1950) 
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ACCURACY OF VISUAL INTERPOLATION BETWEEN SCALE 
MARKERS AS A FUNCTION OF THE NUMBER ASSIGNED 
TO THE SCALE INTERVAL! 


BY A. CHAPANIS AND M. LEYZOREK * 
Department of Psychology, The Johns Hopkins University 


INTRODUCTION 


During the past several thousand 
years, mathematicians and logicians 
have built up a considerable body of 
information about the formal properties 
of numbers and their uses. In spite of 
this, very little is known about the psy- 
chological problems involved in the han- 
dling of numbers when they are attached 
to tools of everyday life such as scales of 
length or weight. That there are some 
genuine psychological problems here can 
be inferred. from a few studies already 
published. There is now clear evidence, 
for example, that most people have 
marked number preferences and use 
some numbers more commonly than 
others. Thus, if simple numbers are 
attached to marks on a ruler and people 
are asked to interpolate the final digits 
of a series of measurements, the distribu- 
tion of final digits does not come out as 
one would predict. Even when the 
digits from o to 9 have equal probability 
of occurrence, too many 0's, 2’s, 5’s and 


1 This investigation was performed under 
Contract N5-ori-166, Task Order I, between the 
Special Devices Center, Office of Naval Research, 
and The Johns Hopkins University. This is 
Report No. 166-I-107, Project Designation No. 
NR-784-001, under that contract. A _ pre- 
liminary account of this experiment was reported 
by Chapanis (2) at the 1947 Meetings of the 
American Psychological Association. 

* Despite the large number of machines and 
men involved and the complex procedures nec- 
essary, this experiment was completed smoothly 
and efficiently. Credit for this accomplishment 
is due in large measure to the diligent and coop- 
erative personnel of the Field Laboratory and 
the Naval Training Facility (Air Intercept) 
Beavertail who assisted in the work. We deeply 
appreciate the help given us by both the Naval 
Officers and Radarmen of the Facility and the 
civilian technicians and engineers of the Field 
Laboratory. 


8’s are selected and the digits 1, 3, 4, 6, 
7, and 9 are largely avoided (8). 

f somewhat greater interest are the 
results of a war-time report by Vernon 
(5) who was concerned with some factors 
contributing to ease of scale and dial 
reading. Among other things, she varied 
the number of divisions between num- 
bered markers as well as the numbers 
assigned to certain of the markers. Her 
results show that both factors have a 
strong influence on the ease and precision 
with which the scales could be read. 
Scales in which the smallest unit was 
equal to 1, 2, or § were read more accu- 
rately than scales in which the smallest 
unit was equal to 4, 2.5, or 8. Vernon 
tested only a few possible combinations of 
these factors so that her results are not as 
general as we might like. 

A related study by Bartlett ef a/. (1) 
investigated the precision of visual inter- 
polation in a polar-coordinate plot as a 
function of the number assigned to the 
scale interval. Only three number inter- 
vals (5, 10, and 20) were used, but the 
results show an unmistakable difference 
between the accuracies attainable with 
the three scales. 


PuRPOSE OF THE PRESENT 
EXPERIMENT 


The experiment reported here ex- 


plores somewhat more thoroughly 
than has been done heretofore the re- 
lationship between accuracy of visual 
interpolation and the number assigned 
to the scale in which estimations are 
made. 


APPARATUS 


Four standard naval radar indicators were 
used as experimental equipment. These indi- 
cators were of the plan position type, now widely 
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Fic. 1. A schematic diagram of the polar coordinate display showing the concentric circular scale 


markers, the points whose positions are to be estimated, and the circumferential angular scale. 


interpolation is attempted along any radius. 
with the 2,000-yard interval. 
7,100 yards. 


Visual 


The circular scale markers are numbered here for a trial 
Thus, there is a point at about bearing 046° and at a distance of about 
The brightness relations are reversed here; actually the field is dark and the scale 


markers and estimation points are bright (see the text for a fuller explanation). 


used as aids in navigation on the sea and in the 
alr? Although somewhat unorthodox for psy- 
chological experimentation, these machines lend 
themselves well to the presentation of a variety 
of visual problems. Since the details of the de- 
sign and construction of the radar indicators are 
of no consequence to this report, the equipment 
will be described only in the most general terms, 
but the psychological characteristics of the prob- 
lem as presented to the subject will be discussed 
in detail, 


The Visual Display 
The face of a cathode-ray tube (CRT), ap- 


proximately 10 in. in diameter, was the field on 
which the subject’s visual problem was presented. 
The subject saw a pattern of four thin,.phos- 
phorescent, equally-spaced concentric rings on an 
unsaturated, dark, gray-green field. The rings 
served as circular scale markers. Interspersed 


* For a more detailed and technical description 
of the type of equipment used in this experiment 
consult recent articles by Williams et al. (6, 7). 





among the scale markers and concentric with 
them were several short, bright, phosphorescent 
arcs (see Fig. 1). ‘These were the points to be 
located by visual interpolation between the 
appropriate scale markers. 

The points and the concentric scale markers 
were ‘painted in’ on the surface of the CRT in the 
wake of a thin, bright, phosphorescent radial line 
which rotated about the center of the circular 
field at six rpm. This entire display was the 
visible evidence of a pattern of excitation im- 
posed on the phosphor coating of the CRT face 
by a beam of electrons rapidly scanning the sur- 
face of the tube from the center to the periphery 
and back while rotating about the center. (So 
fast was the scanning rate of this beam of elec- 
trons that it was seen, for the most part, as a 
single radial line rotating about the end of the 
line located at the center of the CRT.) 

The basic color of the pattern of concentric 
circles and short arcs was bright blue, but it varied 
somewhat in saturation with the intensity of the 
excitation imposed on the phosphor screen and 
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the time after excitation. At the sweep line, the 
pattern was brightest, decaying with increasing 
distance (and time) behind the sweep to a yellow- 
green and finally to the ground color of the circu- 
lar field. In the interval between successive 
sweeps (10 sec.) the brightness of the pattern 
diminished slowly, but the scale markers and the 
points between thern remained visible through- 
out each rotation cycle of the sweep line. With 
each rotation of the sweep the pattern was simply 
renewed in brightness. 

The circular scale markers were approxi- 
mately one mm. thick. Measured along any 
radius of the display, the points used for localiza- 
tion varied from approximately two to three mm. 
in thickness and were six degrees wide. Five 
points were presented in the circular field at one 
time. Because of electronic limitations, no 
points were ever presented in the innermost area 
of the display. 

Together with its accessory equipment and 
necessary circuits, the CRT was mounted in an 
enclosure about 36 in. high and 17 in. square. 
The horizontal viewing surface of the tube was 
visible through a protective cover glass in an 
opening in the top of the enclosure. Between 
the cover glass and the face of the tube were 
mounted an azimuth scale laid off in degrees 
around the edge of the tube face (see Fig. 1). 


Arrangement of the Apparatus 


Two identical cathode ray tube units of the 
kind described above were used to enable two 
subjects to work at the same time. For con- 
venience these will now be called test units. 
These units were linked in a circuit with two 
high-precision display units equipped with de- 











Fic. 2. A schematic diagram of the arrange- 
ment of equipment and personnel used in this 
study. Shown here are the test units (TU), cri- 
terion units (CU), signal generator (SG), opera- 
tors (O), subjects (S), monitors (M), and experi- 
menter (E). The solid lines show linkages be- 
tween equipment; the dotted lines show tele- 
phone connections between the experimenter, 
operators, and monitors, 
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vices for accurately determining the true position 
of each localization point in the circular test field. 
These precision units furnished the criterion in- 
formation for determining the actual position of 
each point—hence, these will be called criterion 
units. All four display units (two test, two cri- 
terion) were supplied simultaneously with the 
same scale and point pattern by a complex signal 
generator (SG) of special design. A block dia- 
gram of the circuit connecting these units is 
shown in Fig. 2. The four display units were 
isolated from each other to prevent the subjects 
and the criterion unit operators from hearing 
each other’s reports on the position of any point. 
Because of this equipment isolation, a telephone 
network was used to link the personnel working 
at the various stations to a central point at the 
SG without allowing them to hear each other. 
The subjects were not provided with phones. 


PROCEDURES 


After setting in a given pattern the experi- 
menter notified the criterion unit operators and 
the monitors standing by at the test units that a 
pattern was ready for use. The criterion unit 
operators immediately began their work of re- 
porting to the experimenter the exact position of 
each point in the array, always identifying the 
point by its angular location in the field. The 
experimenter recorded the data thus furnished 
hirn by each criterion operator and maintained a 
running check on the agreement between the two 
equipments and their operators. In general, 
the agreement between the reports from the cri- 
terion instruments was very high, with discrep- 
ancies of the order of 0.05 percent. When the 
reports differed by more than 0.5 percent the 
experimenter requested a recheck from each cri- 
terion unit operator. If the discrepancy con- 
sistently exceeded 0.5 percent after checking, the 
experiment was halted until the equipment 
could be examined and recalibrated. 

When the experimenter indicated that a pat- 
tern of points was ready, the monitors standing 
by the two test units started the subjects in their 
task. Each monitor specified the number sys- 
tem assigned to the scale markers for that partic- 
ular trial, and, using a grease pencil, wrote in on 
the glass cover of the display the numerical value 
of each marker (see Fig. 1). 

Eleven different numerical units were tested: 
1,000, 2,000, 3,000, 4,000, 5,000, 6,000, 7,000, 
8,000, 9,000, and 10,000 yards, and 2.5 miles. 
For example, in one trial the subject was told 
that the interval between any two scale markers 
represented 2,000 yards and that he was to make 
his estimate of the position of each point in terms 
of such a scale unit. In this case, the innermost 
marker in Fig. 1 would represent 2,000 yards, 
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the next 4,000 yards, the next 6,000 yards, and 
the outer edge 8,000 yards. Since the nautical 
mile is equivalent to 2,000 yards, a 2.5-mile scale 
unit was included to evaluate its psychological 
equivalence to the 5,000-yard scale. Accuracy 
of judgment was emphasized as thé primary ob- 
jective. No time limit was imposed for the 
task, 

Starting from an arbitrary reference radius of 
the circular display, the subject took each point 
in turn, estimating the position of the point rela- 
tive to the center of the field and to the scale 
markers between which it fell. His estimate was 
reported orally to the monitor who stood beside 
him as he worked. When both subjects (one on 
each of the test units) finished with a group of 
five points, their monitors called the experi- 
menter to indicate readiness for the next pattern. 
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Five such cycles (of five points each) constituted 
acomplete trial. The average time per complete 
trial was about 35 min., with some variation 
from 25 to 45 min. 


Experimental Design 


The experimental trials were arranged in the 
form of an 11 X 11 two-factor design with one 
replication, so that each subject was tested twice 
with each of the 11 scales. The sequence of 
trials was arranged so that the scales were used 
in a different order for each subject in each series. 
Each series of trials (that is, the original and 
replication) was run on a separate test unit and 
the subjects alternated between test units (and 
thus between the trials of each series) in a ran- 
dom order. 


TABLE I 


Tue Exrertmentat Desicn Usep in Tuts Stupy 


The entries are the trial numbers assigned to each condition. 


For example, on the first trial sub- 


ject J was tested on instrument 1 with the 4,000-yard scale; on the second trial, subject B was tested 
on instrument 1 with the 8,000-yard scale and subject H was tested simultaneously on instrument 2 


with the 2,000-yard scale. 








Subject | Instru- 





ment 
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For various reasons, such as breakdown of 
equipment, and loss of a subject in the middle of 
the experiment, we could not adhere rigidly to 
the experimental design and randomized se- 
quences made up before the experiment began. 
Table I shows the design we actually used. It 
is, we feel, completely adequate for our purposes. 

The experiment extended over 24 working 
days with 3 to 12 trials run per day. On 92 
trials, two subjects: were run simultaneously on 
the two test units as originally planned; on 58 
trials only one subject was tested. 

Patterns of localization points were selected 
randomly within prescribed areas so that the 
entire surface of the visual display was equally 
well sampled for each subject and for each scale. 


SuBJECTS 


The subjects were 11 experienced naval radar 
operators trained in the use of polar coordinate 
plots and familiar with the process of visual inter- 
polation in circular scales. 


RESULTS 


The true position of each point was 
defined as the mean of each pair of 
determinations made with the two cri- 
terion units for each localization 
point. After appropriate conversion 
(in terms of the correct numerical 
unit for each trial) the mean criterion 
position for each point was subtracted 
from the subject’s estimate. The de- 
viation thus obtained was expressed as 
a percentage of the interval used in 
that trial. For example, let us sup- 
pose that a subject, on a trial with the 
2,000-yard scale, reported the position 
of a point as 5,000 yards. If the true 
position of the point was 4,960 yards, 
the deviation is +40 yards. The rel- 
ative error score is +40/2000 or 
+2.00 percent. Twenty-five such 
scores were obtained in each trial.‘ 


Constant Errors as a Function of 
Distance from the Center 
of the Display 
When the entire batch of data was 
in, errors were first plotted as a func- 


4 In a few instances, the N for a trial is 24, due 
to recording errors. 
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tion of the distance of the localization 
point from the center of the display. 
The results, shown by the solid line 
in Fig. 3, reveal two marked trends. 
First, there are large positive errors 
near the center or innermost ring of 
the display and these gradually be- 
come less positive for points near the 
outer edge of the display. Second, 
there is a marked cyclical variation in 
the errors between each pair of scale 
marks, 

Without elaborating, we shall state 
categorically that the first trend can 
be traced to calibration errors between 
the test and criterion units. Daily 
engineering protocols maintained by 
the electronic technicians show these 
errors consistently. Unfortunately, 
we, as psychologists, understood so 
little of the electronics of the radars 
that we were not able to anticipate 
and correct for the constant errors in 
the equipment calibration procedures. 
The straight line drawn through the 
data in Fig. 3 was fitted by least 
squares to the entire group of data 
and was used to correct for the con- 
stant errors. Constant errors, with 
the calibration errors removed, are 
shown by the interrupted line in Fig. 
3. In making this correction, we are 
not exactly sure what the zero calibra- 
tion error should be. The correction 
for overall trend is valid, but the point 
of true zero error is open to some ques- 
tion. In statistical terms, we are sure 
of the correction for slope but are not 
sure of the exact correction for the 
intercept constant. We have arbi- 
trarily equated the mean error for the 
11 different numerical scales to the 
interceptconstant. It might be, how- 
ever, that all or most of the numerical 
scales would show positive constant 
errors—if we could measure them ex- 
actly. If this were the case, the inter- 
cept constant we selected was too low. 
In the converse case, all or most of the 








A. CHAPANIS AND M. LEYZOREK 








sen-- fn 
23522328288 








AVERAGE PERCENT ERROR 


-2.5 














INNERMOST 
RING 


SECOND 
RING 


THIRD 
RING 


OUTER 
RING 


Fic. 3. The solid line shows constant errors of estimation as a function of the radial distance of 
points from the innermost scale marker to the edge of the display. ‘The N at each point is shown at 
the top of the chart. The straight line drawn through these data is fitted by least squares and has the 
equation M, = —o.165 R + 4.599, where R varies from 1 (on the left) to 30 (on the right). This 


straight line shows the constant errors of calibration between the criterion and test units. 


When 


these constant errors are eliminated, the constant errors of estimation are shown by the dotted line. 


numerical scales might have negative 
constant errors, in which case our cor- 
rection for the intercept constant was 
too large. This is not as serious a 
difficulty as might appear at first 
glance, and the interested reader is 
referred to Chapter 3 of Chapanis 
et al. (3) for a more thorough discus- 
sion of the measurement of errors in 
systems such as these. In any case, 
the intercept selected in this experi- 
ment is in error by no more than about 
one percent. 

The cyclical nature of the error 
data evident in Fig. 3 is shown more 
clearly in Fig. 4 which combines all 
the data into tenths of the interval 
between any two scale marks. This 
figure shows that points between an 
inner reference ring and the midpoint 
of the interval tend to be overesti- 
mated; points between the midpoint 
and the outer reference ring tend to be 
underestimated; points near reference 


rings, or near the midpoint of the 
interval, tend to have small constant 
errors. Ina study of a paper display 
something like the one we have used, 
Bartlett ¢t al. (1) found no clearcut 
variations in constant errors such as 
we report here. This may be due to 
the fact that Bartlett et al. have dis- 
continuous data (subjects were re- 
quired to report their estimates in 
units which were 1/20 of the interval 
between reference rings), whereas our 
data are continuous (subjects were re- 
quired to report their estimates as 
precisely as possible). With the dis- 
continuous scale used by Bartlett, the 
only possible errors were those of 0, 5, 
10, 15, etc., percent. In our data, 
errors could assume any magnitude. 
Note too that our constant errors, 
though consistent, are small in amount 
—they do not exceed two percent. It 
is possible, therefore, that this tend- 
ency did not show up in the Bartlett 
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Fic. 4. The data in Fig. 3 (interrupted line) 
are here combined for the three scale intervals. 
The numbers in parentheses are the numbers of 
observations contributing to each point. This 
figure shows a consistent pattern of over- and 
underestimation according to the location of the 
point. Note that points near the midpoint of 
the interval are estimated with virtually no 
constant error. 


study because of the coarseness of the 
grouping forced on the judgments of 
the subjects. 


Variable Errors of Interpolation as a 
Function of Location Between 
Reference Markers 


There is also a marked variation in 
the variable errors as a function of the 
position of the point between refer- 
ence rings. In Fig. 5, we have plotted 
the standard deviations of the relative 
errors of estimation for all numerical 
scales as a function of the position of 
the points between reference rings. 
Subjects showed much less variability 
in their estimates for points near refer- 
ence rings. They also showed less 
variability in estimating points near 
the middle of the interval. Variable 
errors for points in the outer half of 
the interval are greater than for those 
in the inner half. Bartlett et al. (1) 
showed similar data for variable errors 
in their experiment and the agreement 
between their curves and ours is good. 
They found, as we did, that variable 
errors are smallest near reference rings 
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and in the center. They also found 
variable errors in the outer half to 
be greater than those in the inner half 
and, finally, their standard deviations 
(when multiplied by five to convert 
from their units of measurement into 
our ‘percent errors’) are of about the 
same magnitude as ours. In sum- 
mary, then, the data in Figs. 4 and 5 
both show that the midpoint of the 
interval has some sort of psychological 
stability and reference value. 


Constant Errors for the 11 
Number Scales 


The primary purpose of this study 
was to investigate the effect of differ- 
ent number scales on accuracy of 
interpolation. Accuracy here has 
been analyzed into two parts: con- 
stant errors and variable errors. 
Table II shows the mean constant 
errors for the 25 observations made 
by each subject under each condition 
of experimentation. The analysis of 
variance for these data is shown in 
Table III. 

It is clear that there are marked 
differences between the constant errors 
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Fic. 5. Standard deviations of relative errors 
of estimation as a function of the radial distance 
of points between scale markers. The numbers 
in parentheses are the numbers of observations 
contributing to each point. The precision of 
estimates is high near scale markers and near 
the mid-point between markers; at other places 


it is not as good. 
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TABLE II 


Mean Re ative Errors or Estimation ror Eacu Susyect anp Eacu Conpition 
The entry in each cell is based on 25 estimates. 
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Mean (Instrument 1) = —0.11; Mean (Instrument 2) = —o.04. 


for the different numerical scales, and 
Table III shows that these differences 
are significant. To find which of the 
constant errors differ significantly 
from each other, we computed ?’s 
for each pair of means with the re- 
sults shown in Table IV. Scales with 
nearly identical constant errors—as 
for example the 4,000 and 9,000 yard 
scales—are grouped together. The 
standard error of the difference was 
computed by taking the figure for the 
population variance estimated from 
the interaction (Table III) and ¢-ta- 
bles were entered with 1 and 100 de- 
grees of freedom. This, of course, 


It is immediately apparent from Table 
IV that there is only one number 
scale—-the 2.5 scale—which is really 
deviant. For some unexplained rea- 
son, this scale gave a negative con- 
stant error which was significantly 
different from the constant error for 
every other scale—except possibly the 
3,000 scale. Constant errors for all 
scales other than the 2.5 scale do not 
differ significantly. It is our opinion, 
incidentally, that the significant inter- 
action in Table IV can also be traced 
to the 2.5-scale data. Although this 
scale gave the largest average negative 





yields a more sensitive test of signifi- 
cance than would be achieved with 
the standard error computed from the 
22 items contributing to each mean. 


constant error, note in Table II that 
subject H had his second-highest 
positive constant error with the same 
scale. 
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TABLE III 


ANALYSIS OF VARIANCE OF THE 
Data n Taste Il 





Sums of 


Source of Variance Squares 





Between Subject 
Means 


Between Scale 
Means 


70.70 


221.66 


Interaction 631.17 


Between Instru- 
ments Within 


Cells 436.95 
Total 





1,360.48 














* Significant at the 1% level 
** Significant at the 0.1% level when tested 
against the interaction 


Variable Errors for the 11 Scales 


Variable errors for the same data 
are shown in Table V and the analysis 
of variance of these data is shown in 
Table VI. As in the case of the con- 
stant errors there are some marked 
differences among the various scales 
and these differences are highly sig- 
nificant. Comparisons between pairs 
of means for the various scales are 
shown in Table VII which gives t-val- 
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ues and levels of significance for all 
worthwhile comparisons. As in Ta- 
ble IV, scales with nearly identical 
means are grouped together. 

Table VII shows that differences be- 
tween the mean errors for the 10,000, 
1,000, 2,000, 5,000, 4,000, and 8,000 
scales are not significantly different. 
The 9,000 scale, however, is signifi- 
cantly worse than the 10,000 scale at 
the one percent level, and worse than 
the 1,000 scale at the two percent level 
of confidence. The null hypothesis is 
acceptable for the differences between 
the 9,000 scale and the 2,000, 5,000, 
4,000, and 8,000 scales. 

The 3,000 and 6,000 scales are sig- 
nificantly worse than the 10,000 and 
1,000 scales at the one percent level of 
confidence. They are not signifi- 


cantly different from the 2,000, 5,000, 
4,000, 8,000, and 9,000 scales. 

The 7,000 scale is worse than the 
10,000 and 1,000 scales at the one per- 


cent level; it is worse than the 2,000, 
5,000, 4,000, and 8,000 scales at be- 
tween the § and 10 percent levels; and 
is not significantly worse than the 
9,000, 3,000, and 6,000 scales. 

The 2.5 scale is significantly worse 
than every other scale. in some 


TABLE IV 


Tue t Ratios ror THe Sicniricance or Dirrerences Berween Mean 
Constant Errors Suown 1n Taste Il 


Entries in parentheses are the levels of significance attached to each t. 


constant errors are grouped together. 


Scales with nearly identical 


For every comparison the scale at the top of the table has a 


more positive constant error than the one with which it is compared along the side of the table. 








4,000 


9,000 * 8,000 


6,000 


5,000 


1,000 7,000 


3,000 











1.68(10%) 
1.82(10%) 
2.18(5%) 


1.66(10%) 


2.02(5% 1.84(10% 
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6.31 . 6.31] 
S.E.piat = eS + $3t] = 0.757. 
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cases, the significance is better than 
the 0.1 percent level. This is true, 
for example, for the comparison of the 
2.5-mile scale with the 5,000-yard 
scale, which is especially interesting 
because both kinds of scales have been 
used on modern radars. As was 
pointed out earlier, the 2.5-mile scale 
is equivalent to the 5,000-yard scale 
because a nautical mile is equal to 
2,000 yards. Despite this numerical 


equivalence, our data show that vari- 
able errors of interpolation are 1.6 
times greater for the 2.5-mile scale as 
compared with the 5,000-yard scale. 
This is a striking illustration of the 
psychological non-equivalence of two 
numerically-equivalent scales. It is 
worth noting in passing that the vari- 
able errors with the 2.5-mile scale are 
nearly twice as large as those for the 
best scale—the 10,000-yard scale. 


TABLE V 


Stanvarp Deviations or RELative 


Errors or Estimation ror Eacu 


Susjyect ano Eacu Conpition 


The entry in each cell is based on 25 estimates. The standard deviations are 
computed around the mean error for each cell as given in Table II. 








Number Scale 





5.000 


7,000 





4-95 
3.48 





2.43 
2.86 


3.07 
5-29 


2.96 
7-84 





6.54 
6.57 








6.45 
5.67 








2.85 | 3.82] 3. 3-93 
3-04 | 4-76 | 3. 10.42 


5-34 
4.30 
4.12 
2.57 
4-49 
4:57 


2.64 


2.27 





3-89 
3.30 


3.66 
4-37 
5.18 
7-09 


3-93 
5.66 


12.84 
12.53 


5.63 
7-39 
2.81 
5.08 


3.68 
5.86 


3-86 
5-24 
8.12 
4:74 


4.20 
8.89 

















4-63 








5.82 























Mean (Instrument 1) = 5.21; Mean (Instrument 2) = 5.08. 
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TABLE VI 


ANALYsIs OF VARIANCE OF THE 
Data tn Taste V 








Sums of 


Source of Variance Squares 





Between Subject 
Means 


Between Scale 
Means 


Subject-Scale 
Interaction 


268.15 


208.19) 


463.05 

Between Instru- 
ments Within 
Cells 


Total 


491.29 





1,430.68 














* Significant at the 0.1% level 


Analyses with Other Methods 
of Measuring Errors 


To check on the stability of these 
findings, we recalculated the analyses 
presented in Tables V and VI using 
different methods of computing errors. 
For example, we calculated standard 
deviations of relative errors of estima- 
tion around the mean constant errors 
shown along the bottom of Table II 
instead of around the mean constant 
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errors for each cell of Table Il. (The 
latter method, of course, is the one 
we used to get the values in Table V.) 
Then we also calculated average er- 
rors of estimation (average deviations) 
for the data when they are uncorrected 
for the constant errors of calibration 
shown in Fig. 3. The results of these 
alternative analyses are gratifyingly 
consistent. There are some slight 
shifts in the rank-orders of the vari- 
ous scales, but the shifts are never 
more than two rank-order positions. 
No matter how errors are defined, the 
1,000, 10,000, 2,000, and 5,000 scales 
are always the best four; the 3,000, 
6,000,"7,000, and 2.5 scales are always 
the worst four; and the 4,000, 8,000, 
and 9,000 scales always rank in the 
middle. 


Discussion 


The primary objective of this re- 
search was to obtain information on 
the accuracy of visual interpolation 
when various number systems are as- 
signed to scale markers. Significant 
differences have been demonstrated 
between numerical scales in this re- 
gard, but for purposes of practical 
scale design the information obtained 


TABLE VII 


Tue ¢t Ratios ror THe Sicniricance or Dirrerences Berween Mean 
Stanparp Deviations SHown tn Taste V 


Entries in parentheses are the levels of significance attached to each ¢. 


mean standard deviations are grouped together. 


Scales with nearly identical 


For every comparison the scale at the top of the 


table is better than the one with which it is compared along the side of the table. 
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from this experiment must be com- 
bined with the data of other, related 
studies. We have shown that nu- 
merical scale units of 1,000 and 10,000 
yards give maximum accuracy of vis- 
ual interpolation. From this we can- 
not generalize that all scales should 
have numerical units of 1,000 or 
10,000 since both the physical size 
and the total numerical range of the 
scale also have to be considered. 
Leyzorek (4), for example, has shown 
that best relative accuracy of inter- 
polation occurs in scale intervals } in. 
in size. Thus for some sizes of display 
it might be better to use a 2,000 or 
5,000 scale. 

Still to be explored is the possibility 
that number system and physical 
scale size may interact in such a way 
as to further limit the generality of 
these findings. The fact that this 
experiment studied only a particular 
series of numbers with a specific unit 
of length (yard) also restricts the de- 
gree of generalization possible. These 
limitations grew out of the nature of 
the equipment used. 

A number of interesting questions 
are raised by the line of investigation 
pursued in this experiment. Super- 
ficially, at least, the visual interpola- 
tion problem remained the same for 
the subject regardless of the numeri- 
cal scale with which he worked. But 
the difference in performance, for ex- 
ample, between the numerically- 
equivalent 5,000-yard and 2.5-mile 
scales shows that the psychological 
problem was not the same, at least 
for some subjects, with these two 
scales. The question arises: In what 
way did the numerical scale influence 
the judgments made by the subjects? 


The process of visual interpolation 
probably involves setting up a series of 
‘anchor points’ in the interval between 


scale markers. For example, the subject 
may visually divide the scale interval 
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into halves, quarters, or thirds, and may 
then subdivide these fractions further. 
It is largely in terms of such fractional 
units or anchor points that the subject 
makes his estimate of the position of a 
point, later converting the fractional part 
of the interval corresponding to the 
point’s position into its numerical equiv- 
alent in the scale being used. We then 
ask: (a) Does the subject use one frac- 
tionation technique all the time and con- 
vert his fractions into the scale in use at 
the time, or does he use a different set of 
anchor points, and fractions for each 
numerical scale? (b) Is there a true 
semantic or mathematical difference be- 
tween the numerical scales which renders 
fractionation or conversion of fractions 
more difficult in one number system than 
in another? (c) Is there some optimal 
fractional unit or combination of units 
which will give maximum accuracy in 
visual interpolation? For the answers 
to these questions, new experiments must 
presumably be devised. 

Why any one numerical scale was 
superior to the others is a question which 
remains unanswered by this experiment, 
but some logical speculation on this sub- 
ject is defensible. The superiority of the 
1,000- and 10,000-yard scales seems 
clearly based on our almost universal 
acceptance and use of a decimal system 
of numbers. Vernon (5) has shown that 
whole numbers in the decimal system, 
such as 1,000, 100, 10, gave the fewest 
errors while scale intervals in steps of four 
gave the most errors. Scale divisions in 
steps of two were almost as good as those 
in the decimal units. In general, chang- 
ing the scale number by a factor of 10 
did not significantly alter its efficiency. 
Another finding of Vernon’s study was 
that decimal fractions such as .o1, .1, .2, 
were much harder to use than were whole 
numbers like 1, 10, 2, 20. This is con- 
sistent with our finding that the 2.5-mile 
scale gave the poorest results. 

Yet to be determined is the basis for 
the superiority of the decimal scale unit. 
Certainly, familiarity is one important 
factor, but the mental process involved 
in the fractionation of numerical scale 
units needs to be studied for the final 
answer. 
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SumMMARY 


This experiment was designed to get 
information on the relationship be- 
tween accuracy of visual interpolation 
and the numerical system assigned to 
scale markers. The visual problem 
presented to the subject contained a 
number of short arcs (called localiza- 
tion points) scattered in and concen- 
tric with a pattern of four equally- 
spaced, concentric circles which served 
as scale markers. The subject’s task 
was to estimate the position of each 
localization point in the field relative 
to the center of the display and to the 
scale markers between which it fell. 

Eleven different numerical scale 
units were tested: 1,000, 2,000, 3,000, 
4,000, 5,000, 6,000, 7,000, 8,000, 9,000, 
10,000 yards, and 2.5 miles. For each 


set of estimates comprising one trial 
the subjects used the number assigned 
for that trial, i.e., in one trial the sub- 
ject made his estimates in terms of a 


2,000-yard scale interval unit while 
in another trial he used a 4,000-yard 
unit, and so on. The experimental 
trials were arranged in an 11 XII 
factorial design with one replication. 

Relative errors of estimation were 
analyzed into two components, con- 
stant errors and variable errors, the 
former being expressed by the mean, 
the latter by standard deviations 
around the mean. Differences be- 
tween the constant errors and variable 
errors for the different scales were 
found to be highly significant. In the 
case of the constant errors, the sig- 
nificance was due entirely to one devi- 
ant scale—the 2.5 scale—which had a 
large negative constant error as com- 
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pared with the other scales. Con- 
stant errors for the other 10 scales did 
not differ significantly. 

There were several highly significant 
differences among the scales in terms 
of error variability. Ranked in order 
from best to worst (smallest to largest 
variable errors) the scales are: 10,000, 
1,000, 2,000-5,000 (equal), 8,000, 
4,000, 9,000, 6,000, 3,000, 7,000, and 
2.5. The worst scale (2.5 miles) gave 
errors about twice as large as those 
obtained with the best scale (10,000 
yards). 


(Manuscript received for immediate 
publication July 10, 1950) 
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THE VISUAL DISCRIMINATION OF GEOMETRIC FORMS * 


BY ROLAND CARL CASPERSON 
The Johns Hopkins University ** 


INTRODUCTION 


Form discrimination as it is used 
here means the identification of a par- 
ticular geometric form as being differ- 
ent from all other possible ones. 
Practically, the total number of forms 
in any experiment is usually small so 
that discrimination involves a choice 
of one among the several forms pre- 
sented. The discriminability of a 
form is usually measured by frequency 
or percent of correct reports as a func- 
tion of some variable. Those vari- 
ables most often investigated in the 
literature have been size, intensity of 
illumination and position in the ret- 
inal field. Regardless of the vari- 
able chosen, the relative discrimin- 
ability of a particular form is essen- 
tially unaffected. 


Theoretical Background 


Empiricists predict that a familiar 
form will be easier to perceive or dis- 
criminate. Although it has been demon- 
strated (1, 5,6, 7,9, 12) that experience 


is related to the discrimination of form, * 


it is practically impossible to apply the 
Empirical theory to geometric forms. 
Most geometric forms are already so 
familiar to the average O that no amount 
of experimental practice would produce 
the necessary differential in experience. 
About the only other systematic treat- 
ment of form perception is that of the 


* This research has been supported in part 
under the terms of a contract between Special 
Devices Center, Office of Naval Research, and 
The Johns Hopkins University, Contract N5- 
ori-166, Task Order I. This is report No. 166-I- 
108, Project Designation No. NR 784-001, under 
that contract. 

** The author is indebted to Dr. W. R. Gar- 
ner, who aided in the formulation of the problem 
and saw the research to its completion. 


Gestalt school. Although the Gestalt 
theory of ‘simplicity’ or ‘good figure’ 
can be applied to geometric figures, most 
investigations have disproved the con- 
figurationist hypotheses. For example, 
Helson and Fehrer (8) demonstrated the 
inadequacy of the Gestalt concept of 
‘simplicity.’ According to this concept 
the circle is the simplest figure and for 
this reason should be identified more 
easily than other forms. Actually, Hel- 
son and Fehrer found that the circle 
ranked after the rectangle and triangle in 
perceptibility. Similar results have been 
found in other studies (4, 11, 13, 14). 
Although the order of discriminability 
for different forms varied in these studies, 
they all agree that the circle is neither 
good nor exceptionally poor in discrim- 
inability. 

A second point which Helson and 
Fehrer discuss is the Gestalt idea that 
form is primary in experience. Accord- 
ing to the Gestalt theory all experience 
is formed and therefore form is primary 
in perception. Several studies in the 
literature have found this to be untrue 
(2, 3, 5, 8, 16). Helson and Fehrer 
found three levels of perception in their 
experiment which they also considered 
to be negative evidence for the primacy 
of form. Their Os first reported vague 
formless light, then a formed object 
which could not be correctly identified, 
and finally the correct form. In an at- 
tempt to defend the Gestalt hypothesis 
Wilcox (15) interpreted these findings as 
showing that the initial perception was 
formed. Although the light could not 
be correctly identified as a shape, it ap- 
peared as a rounded blur, which fit Wil- 
cox’s criterion for form. Such a defini- 
tion of form leaves little in our experi- 
ence that is unformed. It also makes 
any discussion of form perception or dis- 
crimination meaningless. If we use the 
traditional interpretation of discrimina- 
tion as the distinction and recognition of 
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a particular form as different from others, 
we can dismiss Wilcox’s definition of form 
perception. If a stimulus need only 
appear as a vague round blur of light to 
qualify as a form, there is no way of test- 
ing the primacy of form in experience. 

It is also possible to argue that when 
form is indistinguishable stimuli are re- 
duced to a roughly circular. blur. This, 
then, is not form but a complete lack of it. 
Viewed in this way, the circle, instead of 
being the ideal form as configuration 
theory maintains, approaches formless- 
ness because it most closely resembles 
that condition wherein form cannot be 
discriminated. 

Such a conclusion has interesting im- 
plications. If we expect stimuli below 
the recognition threshold to appear round 
we might expect the frequency of reports 
for circles, ellipses and other similar 
figures to be high in any task in which 
form thresholds are measured. Under 
these circumstances we should also ex- 
pect on the basis of chance alone that 
more circles would be called correctly, 
and thus falsely have lower thresholds 
than other forms. 

A recent study by Hochberg, Gleitman 
and MacBride (10) offers one of the few 
bits of evidence supporting the Gestalt 
theory of form perception. Using three 
figures, a circle, rectangle and block cross 
of equal area, these investigators ob- 
tained intensity thresholds for form dis- 
crimination. Testing the Gestalt con- 
cept of ‘simplicity,’ which they defined 
as the ratio of perimeter to area, their 
results confirmed the hypothesis. The 
circle had the lowest threshold, while the 
square and the cross had progressively 
higher ones. 

There are two possible reasons for such 
excellent results. The first is that these 
investigators used only three figures. 
Such a small number of forms can hardly 
be considered an adequate test of so 
broad an hypothesis. Second, these 
authors present no information about 
the frequency of report for their figures 
regardless of accuracy. As was men- 
tioned above, a high incidence of circle 
reports, whether due to preference or the 
above mentioned conditions, would make 
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the circle appear to be superior. Al- 
though we cannot reject configuration as 
a possible determinant in discrimination, 
some quantifiable aspect of form should 
lead us to a little more understanding of 
the problem. Hochberg, Gieitman and 
MacBride have made a step in the right 
direction, but their results fall short of a 
conclusive determination of the import- 
ant factors. 


Purpose 


The specific purpose of this in- 
vestigation was to determine the dis- 
crimination thresholds of six different 
geometric forms and to relate their 
relative discriminability to three quan- 
tifiable aspects of their construction. 
To produce variations within the 
basic forms,' five different figures for 
each form were constructed. These 
figures, when equal in area, differed in 
maximum dimension and perimeter. 
These variations were necessary to 
demonstrate the effect of perimeter 
and maximum dimension without con- 
tamination by the property of the 
basic configuration of the six forms. 
Finally, each set of figures was repro- 
duced at seven different areas in 
order to measure discrimination 


thresholds. 


APPARATUS AND OBSERVERS 


There were 30 different stimulus figures used, 
consisting of the six basic forms: ellipse, rectan- 
gle, triangle, diamond, cross and star. Each of 
these forms was constructed in five different ways 
as shown in Fig. 1. Although all of the figures 
were made to have the same seven areas it was 
impossible to obtain this equality for the linear 
dimensions because of the nature of the geometric 
relations involved. They were constructed, 
however, so that the maximum dimensions and 
perimeters for the forms agreed as closely as 
possible. 

Six complete sets of stimulus figures were 
made for each of seven areas. The legend of 


1 In this paper, the word ‘form’ will be used to 
refer to a general class of regular geometrical 
shapes, for example, star, rectangle, circle or tri- 
angle; ‘figure’ will refer to variations in con- 
struction of any form, 
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Fig. 1 shows the seven areas used and the 
maximum dimension and perimeter resulting 
for the smallest set of figures. The linear di- 
mensions for the other sets of figures vary as the 
square root of the area and can be obtained for 
each area by using the multiplying factor indi- 
cated. The figures were solid black photo- 
graphic prints on semi-gloss paper; maximum 
contrast was obtained. ‘They were reduced from 
a master set of figures which were constructed 
eight times the size of the largest stimulus area 
used. In this way any minor error in construc- 
tion of the original figures was eliminated in re- 
duction. The stimuli were mounted on heavy 
cards measuring six and one-half in. square. 
Each card contained nine figures in random order 
arranged in three rows and columns. The 
figures were placed two in. apart from center to 
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center, regardless of area. For each area a set 
of 20 cards was necessary to contain all of the 
figures. 

The stimulus cards were exposed from the far 
end of an illuminated exposure box measuring 36 
in. square. The box was painted flat white on 
the interior and was lighted by four G. E. Lumi- 
line light bulbs around the four sides at the for- 
ward end of the box. The lights were shielded 
so that no direct light fell on the stimulus cards 
oron 0. The illuminance measured on the sur- 
face of the stimulus cards was 11.2 foot candles. 

O sat 20 feet from the exposure aperture and 
looked through the open front end of the illu- 
minated box. He was seated comfortably in an 
adjustable chair and was provided with an arm 
and chin rest. Although O used binocular vision 
and was allowed to move his head at will, the 


FORMS 
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Fic. 1. The experimental forms and figures. Maximum dimension (MD) and perimeter (P) in 


ems. for the smallest area used are indicated below each figure. 


These linear measures for the six 


larger areas can be obtained by multiplying the given values by 1.5, 2, 2.5, 3, 4 and 5. The seven 
areas used were .032, .072, .128, .200, .288, .512, and .800 cm? 
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arm and chin rest positioned his head so that he 
was directly in front of the exposure aperture 
at the correct height. 

The Os were 20 male students ranging from 
18 to 27 years of age. All Os had normal acuity 
as measured by the AO test chart; five of them 
wore corrective lenses. Especial care was taken 
to avoid astigmatics. 


PROCEDURE 


Each O had seven experimental sessions in 
each of which he judged a complete set of 30 
figures for a particular area 24 times. Each 
time a figure was repeated it appeared in a dif- 
ferent relation to the surrounding figures and in a 
different orientation. The sessions lasted from 
one to two hours depending on the size of the 
figures and the speed with which a subject could 
make his reports. The order of areas and cards 
within an area was counterbalanced throughout 
so that no two Os were presented the same 
order of figures and areas. 

A set of written instructions was given to each 
O at his first experimental session and the pro- 
cedure was discussed with him. He was shown 
a card containing all of the figures with their 
proper identifications. He was told only to re- 
port in terms of the six basic forms named. It 
was necessary to add this precaution because of 
the presence of a circle in the group of ellipses 
and a square in both the rectangle and diamond 
group. These distinctions were made quite 
readily by the Os and very few errors of confusion 
occurred. The Os were advised not to spend 
too much time on any one figure and the method 
of forced guesses was used. 

E sat behind the lighted box where he ex- 
posed the cards by means of a chute and lever 
which positioned the cards until released. O 
started with the upper left figure on a card and 
read them off from left to right as rapidly as he 
could without sacrificing accuracy, progressing 
down the card until he judged all nine figures. 
No knowledge of results was permitted and no 
learning was apparent other than familiarization 
with the procedure. 


RESULTS 


Because the two major problems of 
interest in this investigation involve 
somewhat different treatments of the 
data, the results are divided into two 
main sections, with an added discus- 
sion on individual differences. Al- 
though the results concerning the rela- 
tive discriminability of different forms 
might logically seem to take prece- 
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dence, the data pertaining to discrim- 
ination as a function of area, maxi- 
mum dimension and perimeter are 
presented first. These data contrib- 
ute to both questions and therefore 
reproduction of data is reduced by fol- 
lowing this order. 

Frequency of reports—Percent of 
correct reports, the measure of dis- 
criminability, has in all cases been 
corrected for response probability. 
Instead of correcting for the theoret- 
ical chance prediction based on the 
number of forms used, the corrections 
were based solely on the empirical 
frequency of reports for each of the 
forms. The correction formula is 
essentially the same as the one usually 
used in chance corrections, but the 
proportion used is determined by the 
percent of total error responses that 
are called a particular form. 

Table I shows the frequency of re- 
sponses and the percent of the total 
possible responses which were made 
for each form, regardless of accuracy. 
It can be seen from Table I that the 
ellipses exceeded all other forms in 
frequency of reports. The rectangle 
and triangle are also somewhat higher 
in frequency than would be predicted 
by chance, while the diamond, cross 
and star fall below chance expectancy. 
This confirms the suspicion raised by 
the earlier discussion in this paper, and 
emphasizes the need for a correction 
based on the frequency of reports. 


Discrimination as a Function of Area, 
Maximum Dimension and Perimeter 


The geometric relationships be- 
tween the variables considered in this 


* Correction for frequency of report was ob- 


tained by 


Nv 
(Re. = Tx - ; where: Ra = 


— Px 
corrected number of right responses for form X; 
T, = total number of form X presented; W, = 
number of form X wrong; P, = proportion of 
total errors made on all forms which were called 
form X. 
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TABLE I 


Frequency anp Percent or Reports ror Eacn Form Recarpiess or AccurAcY 
Theoretical chance percent based on the number of forms = 16.7 


| 
| 
| 





Percent 21.4 


17.1 


19-5 


experiment preclude any possible sys- 
tematic evaluation of the effects of 
one variable while the others are held 
constant. In setting up the experi- 
ment the seven different areas were 
selected to produce functions for all 
of the forms. The figures within a 
form on the other hand were con- 
structed to produce variations in 
maximum dimension and perimeter 
for these basic forms. It is obvious 
that these characteristics must vary 
together and in relation to area. 


For this reason it is necessary to ap- 
proach the problem of their influence 
on discrimination indirectly. 


Figs. 2, 3, and 4 show percent cor- 
rect reports as a function of area, 
maximum dimension and perimeter, 
respectively. Although these vari- 
ables are necessarily related in the 
construction of the various figures, it 
can be seen from these plots that the 
functions do not correlate perfectly. 
The shape of the curves for the 30 
forms and figures differs considerably 
depending on the variable considered. 
If any one of the measures investi- 
gated in this experiment was the sole 
determinant of a form’s discrimina- 
bility, all of the functions for that 
variable should fall on the same curve. 
The degree to which these functions 
differ for a particular variable indi- 
cates the amount to which some char- 
acteristic of form other than that 
variable contributes in discriminabil- 
ity of the various forms. 

In this experiment we are interested 
in finding that variable which pro- 
duces discrimination functions that 








Diamond Star Total 


14,919 12,806 100,800 
14.8 12.7 100 


| 14,591 
14.5 


are most alike for all of the figures. 
This variable then would reduce the 
variations among functions to a mini- 
mum, leaving the least amount of 
variance between curves to be ac- 
counted for by some other property 
of form. 

Inspection of Figs. 2, 3, and 4 shows 
that the functions differ considerably 
in the degree to which they approxi- 
mate a single curve. It can also be 
seen from these data that the func- 
tions differ for the linear measures, 
i.e., maximum dimension and perim- 
eter, in their points of origin and in 
their extent. This makes a quantita- 
tive comparison of the variance re- 
maining between the functions for the 
three variables impossible. We must, 
therefore, obtain some arbitrary value 
for each variable at which we can 
compare the functions for all of the 
forms and figures. The value having 
the most reliability and having a 
direct relationship to all of the func- 
tions is the 50 percent threshold value 
obtained by combining the data from 
all of the functions into a total mean 
curve’ (heavy solid line in upper left 
diagram of Figs. 2, 3, and 4). The 
threshold value for each variable on 
this total mean curve (shown as 
vertical dotted line in each set of 
functions) is the point about which the 


* Mean curves were computed for a range in- 
cluding the 25th and 75th percentiles in all cases, 
based on five arbitrarily chosen values on the 
abscissa for the linear variables. This was done 
to account for the differences in the empirical 
values of each variable used in plotting the 
functions. 
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variation between forms and figures 
is computed. The variance of the 
individual functions about the total 
threshold then is a measure of the 
inadequacy of a variable in predicting 
the discriminability of these figures. 
Therefore that variable which pro- 
duces the least variation among func- 
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tions contributes most to the dis- 
crimination of form. 

The total variance for the 30 func- 
tions can be partialed into two com- 
ponents, variance between forms and 
variance within forms. Between- 
form variance is computed by obtain- 
ing the deviations in percent of correct 
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Fic. 2. The relation between area and percent correct reports (each point represents 480 judg- 
ments). Light solid lines represent the functions for the figures within a particular form. The heavy 
dashed line represents the mean curve for each form. The vertical dotted line in each graph repre- 
sents the threshold value for the total mean curve for all forms (drawn as heavy solid line in upper 
left graph). 
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Fic. 3. The relation between maximum dimension and percent correct reports (each point repre- 


sents 480 judgments). 
form, 


Light solid lines represent the functions for the figures within a particular 
The heavy dashed line represents the mean curve for each form. The vertical dotted line in 


each graph represents the threshold value for the total mean curve for all forms (drawn as heavy solid 


line in upper left graph). 


reports for the mean curves for each 
of the six forms (heavy dashed lines 
in diagrams) around the total mean 
threshold. Within-form variance is 
obtained from the deviations in per- 
cent of correct reports of the five indi- 
vidual figure curves (light solid lines 
in illustrations) around their respec- 
tive mean form curves at the total 


mean threshold value for each vari- 
able. In this way the average within- 
form variance and the between-form 
variance add up to the total variation 
for the 30 functions around the total 
mean threshold. 

Table II shows the amount of vari- 
ance remaining for the functions of 
each of the three variables considered, 
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as measured by the foregoing method. 
It can be seen from Table II that no 
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maximum dimension is held equal. 
And both the crosses and stars differ 


one of the variables is successful in 
minimizing the within-form variance 
for all six of the basic forms. It ap- 
peais that area is most successful for 
the ellipses and triangles. The rec- 
tangles and diamonds differ least w hen 


minimally when perimeter is the con- 
trolling factor. When we consider 
the between-form variance it is ap- 
parent that maximum dimension does 
the best job in equating forms for 
discrimination. Finally the total vari- 
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Fic. 4. The relation between perimeter and percent correct reports (each point represents 480 
judgments). Light solid lines represent the functions for the figures within a particular form. The 
heavy dashed line represents the mean curve for each form. The vertical dotted line in each graph 
represents the threshold value for the total mean curve for all forms (drawn as heavy solid line in 
upper left graph). 
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TABLE Ul 


Variance 1n Percent Correct Reports Remaintnc wHen Forms anp FIGURES ARE 
Comparep ror TuresHotp Amounts or Eacu or THE THREE VARIABLES: 
Area, Maximum Dimension anp Perimerer 



































| Within-Form Variance Between 
, Total 
ans St : 7 Visamce | Variance 
Ellipse | Rec tangle Triangle Diamond) Cross Star | Mean 
oye —— | 
Area 68.5 | 668.7 13.6 136.6 | 1107.5 | 590.4 | 430.9 | 594 1024.9 
Max. Dim. 873.5 46.3 582.8 126.9 373-4 90.4 348.9 | 399.6 748.5 
Perimeter 76.3 107.9 95-5 243-3 246.9 61.2 253-5 | 436.8 690.3 
ance among forms and figures is at a_ criminability. For the crosses and 


minimum when perimeter is held con- 
stant. 

One further point is of interest in 
comparing the variances obtained 
among the forms and figures. An F- 
test indicates that the estimated popu- 
lation variance for between forms is sig- 
nificantly greater than that for within 
forms for all three variables (P < .o1 
in all cases). More directly observ- 
able and more important for our pur- 
poses, however, is the fact that the 
actual between-form variations ob- 
tained in the experiment are greater 
than the average within-form vari- 
ance. This would indicate that the 
differences between forms are real. 
In other words the discriminability 
differences produced by variations in 
the construction within a particular 
form are not as great as those obtained 
with different forms. This means 
we must acknowledge the importance 
of configuration in such a task as form 
discrimination, even when we are con- 
cerned primarily with the manipula- 
tion of quantifiable variables as a 
method. 

We can conclude from these results 
that for the ellipse and triangle forms 
used in this experiment area is the 
important factor in discrimination. 
Knowing the maximum dimension 
of rectangular and diamond shaped 
forms would appear to contribute 
most to our prediction of their dis- 


stars used best prediction of discrim- 
inability would be based on a knowl- 
edge of perimeter. If our task was to 
predict the relative discriminability of 
the average shape for the six basic 
forms used we would do best by com- 
paring their maximum dimensions. 
Finally, if we were to make all 30 
figures equally discriminable we would 
equate their perimeters. 

A word about the direction of the 
relationships involved should be men- 
tioned. In order for the functions to 
be meaningful none of the variables 
should be related negatively to dis- 
crimination. If this were the case we 
might expect variable effects which 
would tend to cancel one another. 
For instance we know that increasing 
the size, be it area or some other 
measure, increases discriminability. 
If we now assume, safely enough, that 
area is that measure we might obtain 
some inconsistent results with the 
methods used in this experiment. 
We know that each of the other vari- 
ables concerned must vary in some 
manner with area and, therefore, if 
one of these factors is negatively re- 
lated to discrimination its relationship 
would not show up in the plots used. 
In fact the effects of such a negative 
relationship might serve to reduce or 
remove the effects of any positive 
relationship present. 

Table III shows the percent of cor- 
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rect reports for each form and figure 
as a function of area. In Fig. 1 we 
saw that maximum dimension in- 
creases with the number assigned to 
the figure within a particular form. 
This is also the case for perimeter ex- 
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cept in the case of two triangles (4 and 
5) in which there is a reversal of the 
relationships. At any rate an in- 
spection of Table III reveals that all 
three variables are related positively 
to discrimination for the rectangles, 


TABLE Ill 


Percent or Correct Reports ror Att or THE Geometric Ficures as a Function 
or Area, Inctupinc Means ror THe Stx Basic Forms anv ror THe Seven Areas 
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crosses and stars. For the other 
three forms the relationship is indeter- 
minate for both maximum dimension 
and perimeter. In no case is there a 
negative relationship between any of 
the variables and percent correct 
reports. 

It would appear then that the 
knowledge we have about discrimina- 
tion can logically be extended to the 
direction of the relationship as well as 
the relative prediction value of each. 
Increasing that variable which is most 
successful in minimizing the variance 
between figures and forms as shown in 
Table II will be most successful in 
improving the discrimination of that 
particular figure or form. 


Relative Discriminability of Forms 


The relative discriminability of the 
six forms used in this experiment can 
be evaluated in several ways. In 
keeping with the methods most often 
used in the literature two comparisons 
have been made. The first compari- 
son actually provides us with three 
measures for these data. In a com- 
parison of the discrimination thresh- 
olds for the different forms it is neces- 
sary to consider each of the three 
variables measured. Since area, maxi- 
mum dimension and perimeter were 
varied we can obtain the threshold for 
each form in terms of each one of these 
variables. The second comparison is 
for the total percent of correct reports 
for each form regardless of the variable 
conditions. 

Table IV summarizes these com- 
parisons for the mean and best figure 
curves of each form for the 50 percent 
thresholds and for total percent cor- 
rect judgments. An inspection of 
Table IV reveals’ several things of 
interest. First, in none of the com- 
parisons does the ellipse rank as a 
‘good figure.’ In only one case (i.e., 
perimeter) does it rise above last place 
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in the rank of discriminability and 
then it attains only a rank of fifth 
out of the six possible forms. Al- 
though the forms shift slightly in their 
rank, depending on the measure used 
we can see that the triangle, cross and 
rectangle consistently maintain their 
ranks within the first three positions, 
while the star, diamond, and ellipse 
occupy the lower three positions, re- 
gardless of the measure. 

Comparison of best igures.—Because 
a circle, the best figure of the forms 
called ellipses, might suffer unduly by 
being grouped with other ellipses, a 
comparison was made of the best 
figures within each form. Table IV 
also shows a comparison of the best 
figure for each form for the four con- 
ditions. It is evident from these com- 
parisons that essentially the same re- 
sults areobtained. Although the best 
ellipse (circle) rises to a rank of three 
in the case of perimeter measures, it 
still remains in the lower half of the 
distribution for three of the four com- 
parisons, and retains its last place in 
two instances. These results are 
consistent with the results cited in the 
literature on size, area and visual 
field, except for the recent study by 
Hochberg, Gleitman and MacBride 
(10). When we consider the forms 
that they used, namely, the circle, 
square and block cross, we can only 
confirm their results. Comparing the 
ranks of these particular figures as 
obtained in this experiment the circle 
also appears to be easiest to identify. 
The square is next in discriminability, 
while the block cross fares worst. 

Although this experiment confirms 
the results of these authors, it reverses 
the conclusions they draw from their 
results, and in fact makes clear why 
their results fit so nicely into their 
theory. The circle, which is the best 
possible ellipse form, is compared with 
the square and block cross, which are 
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TABLE IV 


Revative Discamunasitity or Srx Geomeraic Forms Derermineo sy THE AMOUNT OF 
Area (A), Maximum Dimension (MD) anv Permerer (P) Necessary to Propuce 
so Percent Correct Reports, anp BY THE TotaL Percent Correct Reports 
REGARDLESS OF AREA 


Data are given for mean form and for best figure within a form classification. 


Threshold values 


for the mean and best figure forms can be obtained from interpolation on the curves presented in Figs. 
2,3,and4. Total percent correct reports for the respective forms can be obtained from Column eight 


in Table III. 
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Ellipse .338 
Rectangle 165 
Triangle 117 
Diamond .285 
Cross 105 
Star 255 
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both the worst figures for their re- 
spective forms. Evidence of this 


fact can be obtained by comparing 
the mean percent correct reports for 


each of these figures in Table III 
(these three figures are number 1 in 
their respective form categories). It 
can be seen that the circle is best, 
receiving 40.5 percent correct reports, 
while the square and block cross each 
received 35.9 and 30.2 percent, re- 
spectively. Instead of confirming the 
conclusions of these authors then, 
these results are more important as 
evidence for the inadequacy of their 
test of the ‘simplicity’ hypothesis. 
Further, the results of this experiment 
demonstrate the inadequacy of the 
Gestalt theory of ‘simplicity’ or ‘good 
figure’ in the discrimination of form. 
From these data, it would appear that 
increasing the maximum dimension 
and perimeter, or the complexity of a 
form, in most cases increases the 
probability of its being seen. 
Individual differences.—One point 
that is often neglected in the literature 
on form perception is the problem of 
individual differences. This problem 


is not only crucial to any principles or 
formulations concerned with form dis- 
crimination, but it is very important 
in the simple interpretation of results. 
When we say that one form can be 
identified more easily than another 
we imply that this is a stable char- 
acteristic of a particular form and 
holds for all individuals. Such an 
implication may not be justified. 
For this reason an analysis of the 
Variations in percent correct reports 
between individuals and between the 
six basic forms was made. If we 
maintain that the relative discrimin- 
ability of a form is a property of that 
form and not a characteristic of a 
particular individual at some given 
time, we would expect the form differ- 
ences to be significant in such an 
analysis. 

Table V summarizes the results of 
an analysis of the variance contrib- 
uted by the 20 individual Os in this 
experiment as compared with the vari- 
ance due to form differences for the 
six basic forms. F-tests show that 
the variance due to form and indivi- 
dual differences are both extremely 
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TABLE V 


ANALYSIS OF THE VARIANCE CONTRIBUTED 
BY Form anv Inpivipvat Dirrerences 
Number of forms = 6; number of 

individual Os = 20. 





Sums of Estimated 


Squares 


Source of 
Variance 
Between forms 
Between Os 
Remainder 


1,319.045.08 
793,170.81 
178,399.59 


Total 2,291,521.48 











significant (P <.01), although the 
significance is far greater for form 
differences. An inspection of column 
2 in Table V presents these findings 
and shows that the sum of squared 
deviations obtained between forms is 
much greater than the sum of squared 
deviations betweeen _ individuals. 
These data then verify the earlier con- 
clusion that there are real differences 
in the discriminability of the six forms 
used in this experiment. Further, it 
is clear that if there are any incon- 
sistencies between Os in these func- 
tions, these inconsistencies are very 
small compared to the over-all effect 
of form. 


SUMMARY AND CONCLUSIONS 


Discrimination data were obtained 
for 30 solid geometric figures consist- 
ing of five variations in the construc- 
tion of each of six basic forms: an 
ellipse, diamond, triangle, rectangle, 
cross and star. Percent of correct 
reports, corrected for response fre- 
quency, was plotted as a function of 
area, maximum dimension and per- 
imeter. The effectiveness of each 
variable in predicting discriminability 
for the different forms and figures was 
determined by comparing the vari- 
ance remaining between figures having 
equal threshold amounts of each of 
the three variables. That variable 
which produced the least variance 
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between functions was considered the 
best predictor of discriminability. 


Results 


1. Area was found to be the best 
measure of discriminability for el- 
lipses and triangles. 

2. Maximum dimension best pre- 
dicted discriminability for rectangles 
and diamonds. 

3. Perimeter was the best predictor 
for stars and crosses. 

4. As a group the six basic forms 
can be evaluated best for discrimin- 
ability when their maximum dimen- 
sions are known. 

5. Prediction of discriminability for 
the total of 30 forms and figures was 
best accomplished on the basis of 
their perimeters. 

6. Comparison of the relative dis- 
cri». nability of the six forms used in 
this experiment in terms of thresholds 
and total percent of correct reports 
confirms earlier findings that the circle 
and elliptical shapes in general are not 

"y easy to identify. 

7. These results indicate that the 
Gestalt principle of ‘simplicity’ is in- 
adequate as a predictor of the relative 
discriminability of ellipses, rectan- 
gles, triangles, diamonds, crosses, and 
stars such as those used in this experi- 
ment. 

8. A comparison of the variance 
contributed by the 20 Os with the 
variance due to form differences sub- 
stantiates the hypothesis that forms 
do differ in their discriminability and 
that differences among individuals 
making the discrimination are small 
compared to these form differences. 


(Manuscript received for immediate 
publication July 17, 1950) 
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FIGURE-GROUND REVERSAL AS A FUNCTION OF 
VISUAL SATIATION ! 


BY JULIAN E. HOCHBERG 
Cornell: University 


According to the theory proposed 
by Kohler and Wallach to explain the 
visual effects obtained after prolonged 
fixation of an ‘inspection’ pattern (2), 
the perception of a figure entails the 
flow of a ‘figural current’ through the 
cortical medium, and the raised ‘re- 
sistance’ (presumably some phenom- 
enon of electrotonus) of the medium 
resulting from the current flow is as- 
sumed to be responsible for the figural 
after-effects obtained. Kohler sug- 
gests (I, p. 72) that some such process 
of ‘satiation’ is also responsible for 
the reversal of ambiguous figures: 

If it is assumed that the cortical 
region isomorphically corresponding 
to the perceived figure is the seat of 
the highest concentration of cortical 
current flow (2, p. 328), then which- 
ever portion of an ambiguous pattern 
is first seen as figure should satiate its 
corresponding cortical region at a 
faster rate than do the other portions 
of the field. When the satiation- 
induced resistance opposing current 
flow in this region becomes sufficiently 
high a restructuring of the percept 
should result, with the alternate por- 
tion of the pattern then being per- 
ceived as figure until its satiation 
results in further reversal, etc., until, 
perhaps, the entire region becomes 
extremely satiated.? 


' This paper reports a modified repetition of a 
portion of a dissertation submitted to the De- 
partment of Psychology of the University of 
California at Berkeley in partial fulfillment of 
the requirements for the Ph.D. degree, sponsored 
by Professors Edward C. Tolman, Egon Bruns- 
wik, and Franklin M. Henry, 1949. 

2A rigorous description of the process of 
figure-ground reversal in terms of the model pro- 
posed by Kohler and Wallach would probably 


If satiation is at least one of the 
factors responsible for the reversal of 
ambiguous figures, it should be possi- 
ble to interfere with the process by 
imposing a differential satiation upon 
portions of the cortical region in which 
the figural currents of the ambiguous 
figure are to flow. Thus, if a figure, 
such as Fig. 1c, corresponding to one 
of the crosses of the reversible figure, 
Fig. 1a, is presented for prolonged in- 
spection, the cortical region occupied 
by its isomorphic figure current would 
attain a raised resistance; on sub- 
sequent fixation of the reversible 


Fig. 1a, the cross falling on a less sati- 
ated cortical region should be more 
easily perceived as figure. 


Kohler 
and Wallach hold (2, p. 331) that the 
figural currents isomorphic to the 
perception of a black figure on white 
background are identical with those of 
a white figure on black ground except 
that the direction of flow is opposite in 
the two cases; and that the satiation- 
induced resistance (2, p. 332) is non- 
directional (i.e., a region satiated by a 
current of one direction will equally 
oppose current flow in the same or 
in the opposite direction). Conse- 
quently, it should make no difference 
whether the satiating figure and the 
coincident portion of the reversible 
test figure are of the same ‘color’ (i.e., 
both black on a white ground, or both 


entail consideration of the redistribution of cur- 
rents in the ground as well as in the figure, and a 
knowledge of characteristics of current and 
medium as yet unknown; the hypothesis as pre- 
sented by Kohler, however, seems sufficiently 
clear for the purposes of the problem here con- 
sidered. 
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FIGURE-GROUND REVERSAL AND SATIATION 


Cc q 


Fic. 1. The inspection and test figures 


opposite ‘color’ (one being black on a 
white ground, the other white on 
black). 

This paper reports an attempt to 
determine (a) whether the figure- 
ground reversal of an ambiguous pat- 
tern will be affected by prior satiation 
of the ‘visual region’ upon which one 
of the two alternative figures is to 
appear, and (b) whether any such 
effects are the same when both the 
inspection figure and the test region 
are both black, when they are both 
white, and when one is black and the 
other white. 


PROCEDURE 


Each subject was seated before a reduction 
screen permitting only monocular vision with 
the non-dominant eye, his head in a headrest and 
his left and right hands resting upon telegraph 
keys. A reversible figure 6.5 in. in diameter was 
presented against a neutral gray background 
which filled the field, at a distance of 32 in., and 
the subject was asked to press the left key while 
the black portion of the pattern was figure and 
to press the right key while the white portion of 
the pattern was figure. The key-pressings were 
recorded on a polygraph. After 10 reversals 
had occurred, the reduction screen was so moved 
that only the dominant eye was uncovered, and 
an inspection figure of the same size was pre- 
sented, its background filling the field of vision. 
The subject fixated this figure until he reported 
certain criteria of satiation to have appeared, or 
until five min. had elapsed. After this, the 
dominant eye was covered, and the original 
reversible figure was exposed to the non-domi- 
nant eye, until 10 reversals had been reported by 
telegraph key. The subjects were divided into 
four groups, groups A(N = 15) and B (N = 20) 
having as their inspection figure the black cross, 
Fig. 1c, groups C (N = 20) and D (N = 17), 


the white cross, Fig. 1d. As reversible test pat- 
terns, groups A and C had Fig. ta, groups B and 
D, Fig. tb. 

The instructions to the subject were as fol- 
lows: “You will see a reversible pattern in which 
either the black cross is seen as figure or the white 
cross is seen as figure. Maintain a passive atti- 
tude and allow the figure to reverse itself as it 
will. Maintain a fixation point as close to the 
center of the figure as you can without letting 
the figure fall into wedges, that is, whichever 
cross is being seen should be seen as a complete 
cross. Press the left-hand telegraph key while 
you see the black cross and the right-hand tele- 
graph key while you see the white cross. You 
may practice for a few moments until you feel 
that you can follow the reversals with the tele- 
graph keys.” The instructions with respect to 
the inspection figure were as follows: “This time 
you are not to press telegraph keys; you are to 
fixate the center of this cross, and you are to 
maintain an active attitude of trying to perceive 
the cross as figure, as a whole cross on a gray 
background. If, despite your attempt to the 
contrary, the wedges of background between the 
arms of the cross start becoming figure so that 
the pattern appears to become a field with the 
cross cut out of it, or any similar phenomena 
occur, please let me know.” 


Resu_ts AND Discussion 


The relative amounts of time de- 
voted to perception-as-figure of the 
white and of the black crosses (the 
two alternative figures within the 
ambiguous pattern) was obtained be- 
fore and after satiation, to determine 
which of the two shows an increased 
perception-as-figure in the after-pe- 
riod as compared to the fore-period. 
(See Table I.) A significant propor- 
tion of the subjects in Group A (black 
$atiating figure spatially coincident 
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with black cross of test pattern) re- 
vealed a relatively decreased percep- 
tion of the black cross (or increased 
perception of the white cross) after 
satiation; a significant proportion of 
the subjects of Group D (white sati- 
ating figure coincident with white 
portion of the test pattern) evidenced 
a relatively decreased perception of 
the white cross (or increased percep- 
tion of the black cross) after satiation. 
In Groups B (black satiation figure 
coincident with white cross of test pat- 
tern) and C (white satiating figure co- 
incident with black portion of test 
pattern), however, the results ob- 
tained were not different from what 
would be expected if the satiation 
period had no consistent effects, since 
half the subjects of each group dis- 
played an increased perception of the 
white cross after satiation, and half 
displayed an increased perception of 
the black cross. 

Thus (as is shown in Groups A and 
D), if the satiating figure and the cor- 
responding portion of the reversible 
pattern are of the same ‘color’ (i.e., 
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both are black on a white background, 
or vice versa) that portion of the re- 
versible pattern suffers a relatively 
decreased tendency to be perceived as 
figure, while (as is shown in Groups 
B and C) satiation has no effects at all 
when the black-white relationships 
of satiating figure and the correspond- 
ing portion of the test pattern are 
reversed. Since Kohler and Wallach 
hold that the only difference obtained 
by substituting a black figure on a 
white ground for a white figure on a 
black ground is a reversal of the di- 
rection of the figural currents in- 
volved (2, p. 331), and that satiation 
is non-directional in nature (2, pp. 
331, 332), the results here obtained are 
in conflict with the process of satiation 
as they describe it. Several ways of 
resolving this difficulty suggest them- 
selves: (a) the effects reported in 
this paper may have no close relation 
to the hypothesized satiational nature 
of figure-ground reversal, but may rep- 
resent the working of some as-yet un- 
known process; (b) the figural cur- 
rents arising from a black figure on 


TABLE I 


Frequency anp Percents or Sussyects Wuo DisrpLayep IncrEASED PERCEPTION 
or THe Brack Cross AFTER SATIATION UNDER F.acu ConpiTion 


Group 


Test-figure presented before and after satiation 


Inspection figure, satiation 
No. of subjects in group 


No. of subjects displaying increased perception 
of black cross after satiation 


Percent displaying increased perception of black 
cross after satiation 


Difference from so; 


( 


Standard error by null hy 
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white ground and those from a white 
figure on black ground may be differ- 
ent in other ways, as well as opposite 
in direction; (c) there may exist di- 
rectional as well as non-directional 
components of the satiation-induced 
resistance. To this writer, the last 
hypothesis seems the most economical, 
since it would permit both the con- 
tour-repulsion phenomena discovered 
by Kohler and Wallach, and the ef- 
fects of satiation upon figure-ground 
reversal to be described within the 
same system of constructs, as follows. 


If the state of raised resistance occa- 
sioned by satiation-induced non-direc- 
tional electrotonus were accompanied by 
a certain amount of something analo- 
gous to a residual polarization ‘of the 
medium, this component would oppose 
the passage of current in the direction of 
the satiating current and facilitate cur- 
rent flow in the opposing direction. The 
results here obtained might then be ex- 
plicable as follows: where the color (and 
therefore the direction of current flow) of 
the satiating figure and of the subse- 
quently coincident portion of the test 
pattern were the same, perception-as- 
figure of that portion of the test figure 
would be opposed both by the direction- 
less factor and the directional compo- 
nent; where the color (and direction of 
current flow) of the satiating figure and 
of the subsequently coincident portion of 
the test pattern were opposite, percep- 
tion-as-figure of that portion of the test 
pattern would be opposed by the non- 
directional components and facilitated 
by the directional components, so that 
in certain situations there might be no 
net effect (as obtained in Groups B and 
C). 

As evidence against any directional 
attributes of satiation, Kohler and 
Wallach have demonstrated (2, p. 332) 
that figural after-effects (measured at 
some small distance away from the 
figural region) caused by satiation with a 
black figure on a white ground are not 
noticeably greater than those caused by 
alternate presentation of a black figure on 
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a white ground and the same figure white 
on a black ground (here, reversing the 
brightness relationship of figure and 
ground in the inspection figure reverses 
the direction of the satiating current and 
therefore cancels any directional com- 
ponents, so that the absence of any de- 
crease in effectiveness under these cir- 
cumstances is taken as indicating the 
non-directional nature of satiation). If, 
however, the directional effect were a 
small one compared to the non-direc- 
tional component, it might only be 
noticed in situations depending upon as 
delicate a balance of factors as may be 
operative in figure-ground _ reversal 
(where the non-directional components 
of satiation must quickly become very 
nearly equal in the two alternative figural 
regions) and further, might be noticeable 
only in the region of greatest satiation, 
the figural region itself. With such char- 
acteristics, the absence of the directional 
effect from the situations tested by 
Kohler and Wallach would seem under- 
standable. 


SUMMARY 


It has been found that prior sati- 
ation of the visual region upon which 
one of the two alternative figures of an 
ambiguous test pattern is later pre- 
sented (a) results in the relative de- 
crease in perception-as-figure of the 
portion of the test pattern subse- 
quently coincident with that region if 
both inspection figure and coincident 
portion of the test pattern are black 
on white, or both are white on black; 
(b) results in no change in the figure- 
ground reversal of the ambiguous pat- 
tern if the inspection figure is white 
on black and the coincident portion of 
test pattern is black on white, or 
vice versa. The findings described 
under (a) would seem to be in sup- 
port of Kohler’s suggestion that 
figure-ground reversals are due to a 
satiational process, while the absence 
of those effects in (b) would seem ir- 
reconcilable with a completely non- 
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directional satiation process as de- 
scribed by Kohler and Wallach. The 
possibility of a weak directional 
satiational component is suggested. 


(Manuscript received for immediate 
publication August 9, 1950) 
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